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(57) Abstract 

A graphics system (Figures 1A - IB), includes triangle-engine (150) for real-time rendering into a displayable frame-buffer (140-141) 
of image data derived from vertex-base deferred instructions. The system uses homogeneity values (1/w values) for z-buffer-Iike occlusion 
mapping as well as for texture mapping. Depth resolution is enhanced for both occlusion mapping and texture mapping by representing 
(1/w), (u/w) and (v/w) values in a block-fixed format. 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States 
applications under the PCT. 



AM 


Armenia 


AT 


Austria 


AU 


Austxalia 


BB 


Birbartm 


BE 


Belgium 


BF 


Buxfcxm Faso 


BG 


Bulgaria 


BJ 


Benin 


BR 


Brazil 


BY 


Belarus 


CA 


Canada 


CF 


Cential African Republic 


CG 


Congo 


CH 


Switzerland 


a 


Cote d' I voire 


CM 


Cameroon 


CN 


China 


CS 


Czechoslovakia 


CI 


Czech Republic 


DE 


Gcmtany 


DK 


Denmark 


EE 


Estonia 


ES 


Span) 


FI 


Finland 


FR 


France 


GA 


Gabon 



party to the PCT on the front pages 



GB 


United Kingdom 


GE 


Georgia 


GN 


Guinea 


GR 


Greece 


HU 


Hangar? 


IE 


Ireland 


IT 


Italy 


JP 


Japan 


KE 


Kenya 


KG 


Kyrgystan 


KP 


Democratic People's Republic 




of Korea 


KR 


Republic of Korea 


KZ 


Kazakhstan 


U 


Liechtenstein 


LK 


Sri Lanka 


LR 


Liberia 


LT 




LU 


Luxembourg 


LV 


Latvia 


MC 


Monaco 


MO 


Republic of Moldova 


MG 


Madagascar 


ML 


Mali 


MN 


Mongolia 


MR 


Mauritania 



pamphlets publishing international 



MW 


Malawi 


MX 


Mexico 


NE 


Niger 


NL 


Netherlands 


NO 


Norway 


NZ 


New Zealand 


PL 


Poland 


FT 


PormgaJ 


RO 


Romania 


RU 


Russian Federatkra 


SD 


Sudan 


SE 


Sweden 


SG 


Singapore 


SI 


Slovenia 


SK 


Slovakia 


SN 


Senegal 


sz 


Swaziland 


TO 




TG 


Togo 


TJ 


Tajikistan 


TT 


Trinidad and Tobago 


UA 


Ukraine 


UG 


Uganda 


US 


United States of America 


vz 


Uzbekistan 


VN 


Viet Nam 



WO 96/36011 



PCT/US96/06653 



10 



-1- 

Te^^Ma^ng H ° m ° geneitY Values for De P th for Occlusion Mapping and 



Invent org; 
Donald M. Gray, III 
Adam Mai amy 
Robert Laker 
Adrian Sfarti 
Nicholas Baker 

This is a Continuation-in-Part of U.S. Patent 
15 Application Serial No. 08/438,860, filed May 10, 1995 
(Attorney Docket No. MDIO4500) , entitled CONFIGURABLE 
VIDEO DISPLAY SYSTEM HAVING LIST-BASED CONTROL 
MECHANISM FOR TIME -DEFERRED INSTRUCTING OF 3D RENDERING 
ENGINE THAT ALSO RESPONDS TO SUPERVISORY IMMEDIATE 
20 COMMANDS, by inventors Adrian Sfarti, Robert Laker, 
Adam Malamy, Donald M. Gray, m, and Nicholas Baker, 
incorporated herein by reference. 

BACKGRonwn 
25 Field nf the TnwTiH ny, 

The invention relates generally to digital image 
processing and the display of digitally generated 
images. 



30 



The invention relates more specifically to the 
problem of creating raster-based, high- resolution 
animated images in real time with the aid of a 3D 
rendering engine. 

2- Description of t-fto ft elat-^ ftr * 
35 in recent years, the presentation and pre- 

presentation processing of visual imagery has shifted 
from what was primarily an analog electronic format to 
an essentially digital format. 
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Unique problems come to play in the digital 
processing of image data and the display of such image 



data. 



The more prominent problems include providing 
5 adequate storage capacity for digital image data and 
maintaining acceptable data throughput rates while 
using hardware of relatively low cost. In addition, 
there is the problem of creating a sense of realism in 
digitally generated imagery, particularly in animated 
10 forms of such imagery. 

The visual realism of imagery that is generated 
by digital video game systems, by simulators and the 
like can be enhanced by providing special effects such 
as, but not limited to, making real-time changes in the 
orientation and/or shadowing and/or highlighting of 
various objects, smoothing or sharpening the contours 
of various objects at different times, and so forth. 

Visual realism can be further enhanced by 
projecting 3 -dimensional (3D) surface definitions from 
a model space onto a 2 -dimensional (2D) image plane and 
rotating, scaling or otherwise manipulating the 
3 -dimensional surface definitions in real time prior to 
their projection onto the 2 -dimensional image plane. 

Visual realism can be additionally enhanced by 
increasing the apparent resolution of a displayed image 
so that it has a smooth photography- like quality rather 
than a grainy disjoined- blocks appearance of the type 
found in low- resolution computer -produced graphics of 
earlier years. 

Visual realism can be. even further enhanced by 
increasing the total number of different colors and/or 
shades in each displayed frame of an image so that, in 
regions where colors and/or shades are to change in a 
smooth continuum by subtle degrees of hue/intensity, 
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the observer perceives such a smooth photography- like 
variation of hue/intensity rather than a stark and 
grainy jump from one discrete color /shade to another. 

Although bit -mapped computer images originate as 
a matrix of discrete lit or unlit pixels, the human eye 
can be fooled into perceiving an image having the 
desired photography- like continuity if the displayed 
matrix of independently- shaded (and/or independently 
colored) pixels has dimensions of approximately 500-by- 
500 pixels or better at the point of display and a 
large variety of colors and/or shades on the order of 
roughly 24 bits-per-pixel or better. 

The human brain can be tricked into perceiving a 
displayed 2 -dimensional moving image as being somewhat 
15 3 -dimensional in nature if a sufficient number of cues 
are generated in real-time to support such perception. 
These cues include but are not limited to: 

(a) drawing images along angled lines of 
perspective to create a sense of depth; 
20 (b > shading images to simulate 3 -dimensional 

lighting effects including shadows and 
reflections ; 

(c) allowing displayed objects to rotate so as to 
show their side and back surfaces; 
25 (d) allowing displayed objects to appear to move 

forward and back relative to the viewer by 
appropriately scaling their size; and 
(e) allowing displayed objects to move in front 
of one another as if they were 3 -dimensional and 
had all the associated properties of the real 
objects they portray. 
The above set of visual cues imply that each rotating 
object having reflective surfaces needs to have its 
correspondingly surrounding 3D visual environment 
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wrapped about its surface, and distorted in accordance 
with the contours of its surface in order to create the 
illusion of 3 -dimensional reflection. The above set of 
visual cues further presuppose that translucent moving 
5 objects passing in front of other objects should appear 
to translucently pass through the imagery of the object 
behind. 

Carrying out all these 3D cuing operations in 
real-time can be quite complicated and difficult, 
10. particularly if an additional constraint is added that 
the implementing hardware has to be of relatively small 
size and low cost. 

Compound systems are being proposed that have 
real-time 3 -dimensional object defining and 
15 manipulating means as well as other means that contend 
for access to a shared system memory and for access to 
the shared resources of system CPU's. 

The compound nature of such systems places a 
strain on system memory to deliver (or to store) time- 
20 critical data to (or from) devices or modules that need 
to operate on a real-time basis. 

An example of time- critical data is video data 
that may be needed on a real-time basis, within the 
time window of a horizontal raster line for example, in 
25 order to provide real-time display for an interactive 
game or an interactive simulator. 

The compound nature of such systems also 
increases the likelihood that a minor software error 
(bug) in one software module will induce an unintended 
30 write to a system critical register or memory location 
and bring the whole system down. 

The proposed compound systems have so many 
hardware and software functionalities that they strain 
the throughput capabilities (data bandwidth) of the 
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memory- access management subsystem and complicate the 
tasks of the memory- access management subsystem. The 
memory-access management subsystem now needs to 
arbitrate among a larger number of contenders for 
5 memory access. 

The added functionalities of the proposed 
compound systems additionally strain the throughput 
capabilities and complicate the tasks of any system 
CPUs that have to supervise the activities of the image 

10 manipulating and rendering means on a real -time basis. 
(The term "CPUs" refers here to a general -purpose data 
processing subsystem which may be implemented either in 
a centralized unit format, as for example a single 
truly- central processing unit; or which may be 

15 implemented in a plural units format, such as in a 
parallel processing system.) 

A system architecture is needed for reducing 
contention among plural potential requesters for system 
memory access. 

20 A system architecture is needed for reducing 

contention by plural software modules for access to the 

limited resources of system CPUs. 

A methodology is needed for simultaneously 

satisfying the needs of multiple, time-critical 
25 processes such as those of a real-time video display 

subsystem and those of a real-time animation subsystem. 
A methodology is needed for reducing the 

likelihood that a wayward software or hardware module 

will bring down the entire system by unintentionally 
30 writing to a system critical register or memory 

location. 

In general, a goal of 3D computer graphics is to 
create a 2D projection on a cathode- ray tube ("CRT") 
screen of a three-dimensional model as viewed from a 



WO 96/36011 



PCT/DS96/06653 



-6- 



predetermined viewpoint in three-dimensional model 
space. One aspect of such a projection is the need to 
keep track of which objects are in front of other 
objects, and which are behind, when viewed from the 
5 viewpoint. This knowledge is necessary to ensure that, 
for example, a building in the foreground will properly 
occlude a building in the distance. This aspect of the 
rendering process is known as "occlusion mapping-. 

One popular technique to perform occlusion 
10 mapping uses a construct known as a -z buffer". A 
standard z buffer linearly associates a number called 
the -z value", representing the distance from the 
observer (depth in the scene relative to a projection 
plane) , with each pixel drawn on the screen. When a 
15 first object is projected, attributes of its pixels 
(such as color) are stored in a -frame buffer", and the 
z value associated with each pixel is separately stored 
in the z buffer. if a second object from the model 
subsequently projects onto the same pixel, the new 
20 object's z value is compared against the z value 
already stored for that pixel, and only if the new 
value is less (representing an object closer to the 
viewer) will the new pixel be drawn. 

Fig. 8 illustrates the rendering of Object 1 and 
25 Object 2 at different distances or z values from a 
projection plane or image plane, considered for 
purposes of illustration to be located at z=0 in the 
model space. In Fig. 8, object 1 is projected and 
rendered first. Object 2 is rendered second. The z 
30 buffer prevents pixels of object 2 from being written 
to the frame buffer in the locations where object 1 has 
already written pixels with a lesser z value. Thus, 
object 2 appears in the ultimately displayed image to 
be behind object l, as desired. 
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Z buffers can be implemented in either hardware 
or software. The numbers stored can be either floating 
point or integer values. Any number of bits can be 
devoted to the z values. in general, the more bits 
5 that are devoted to storing the z value, the finer the 
resolution in distance that can be achieved. Because 
z values represent the depth of an object in a scene, 
z values can be more generally referred to as "depth 
values", and z -buffers can be more generally referred 

10 to as 'depth buffers". Also, in particular 
implementations, depth values can be increasing with 
increasing depth, or can be decreasing with increasing 
depth. Since the invention is not restricted to one 
such implementation or the other, depth values 

15 sometimes are referred to herein as being "farther" or 
"nearer" to the viewpoint than other depth values. 

Another feature of 3D graphics Bystems is the 
ability to map a texture onto an object with a 
perspective- correct mapping, as seen in Fig. 9. In the 

20 simplest form, a texture can be thought of as a decal 
that is applied to the surface of an object, such as a 
design on the surface of a cube. In Fig. 9, 902 
designates the cube in a model space 903 and 904 
designates "texture space" containing a "texture map" 

25 905 which is to be applied to all three visible 
surfaces 906, 908 and 910 of the cube as part of the 
rendering process. Since the surface 906 of the cube 
902 is parallel to the projection plane (considered for 
purposes of this illustration to be at z=0 in the model 

30 space 903) , the texture map 905 can be applied directly 
onto that surface. Surface 908 is at an angle to the 
projection plane, so a perspective transformation of 
the texture map 905 is needed before applying it to the 
surface 908. Such a map, as transformed, is 



WO 96/36011 



PCT/OS96/06653 



-8- 

illustrated at 912. Similarly, a different perspective 
transformation of the texture map 905 is required 
before applying it to the surface 910; such a map, as 
transformed, is illustrated at 914. The final image, 
5 as projected onto the projection plane and with 
perspective- correct texturing applied, is illustrated 
as 916 in Fig. 9. it can be seen that the shape of the 
decal on the side of the box is warped as the box is 
rotated to preserve the illusion of three - 

10 dimensionality in the 2D projection. 

In order to achieve perspective -correct texture 
mapping, graphics rendering systems traditionally 
perform projection calculations using a 4x4 matrix 
representing the transformation to be performed. 4x4 

15 matrices are used also for many other kinds of 
transformations in the 3D model space, all as discussed 
in Foley et al. f "Computer Graphics, Principles and 
Practice," 2d. ed. (Addison-Wesley: 1991), especially 
at pp. 213-226 and 253-281. The entire Foley text is 

20 incorporated herein by reference. 

4x4 matrix transformations depend on the 
representation of points in the 3D model space using 
"homogenous coordinates", in which a fourth coordinate, 
w, is added to the traditional three spatial 

25 coordinates x, y and z. Two sets of homogenous 
coordinates are considered to refer to the same point 
in 3 -space if one is a multiple of the other. Thus 
(x,y,z,w) refers to the same point as (*, 1, 1, i) in 

'WWW 

which representation the fourth coordinate ("1") can be 
30 dropped. The process of dividing through by w (and 
optionally dropping the last coordinate) is referred to 
herein as the process of "homogenizing" the point, 
after which the representation is referred to as an 
"homogenized" representation of the point. Similarly, 
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the process of multiplying through by any non-zero and 
non-unity w value is referred to herein as the process 
of "de-homogenizing" the point, after which the 
representation is referred to herein as "de- 
5 homogenized". The value (1) is referred to herein as 
an "homogeneity factor", because the point is 
"homogenized" by multiplying each of the coordinates by 
(1). The value w is referred to herein as an 
"homogeneity divisor", because the point is homogenized 

10 by dividing- each of the coordinates by w. The term 
"homogeneity value" as used herein includes both 
homogeneity factors and homogeneity divisors, since it 
does not imply the function (e.g. multiplication or 
division) by which it is to be applied to the other 

15 coordinates to achieve homogeneity. As will be seen, 
the homogeneity value for a point is related to its 
depth in the scene. 

Thus the projection calculations traditionally 
performed naturally yield a homogeneity value ((1) or 

20 w) for each point projected onto an image plane. 
Traditional texture mapping, which maps a texture onto 
a planar polygon of the model, utilizes the homogeneity 
values as follows. 

Initially, each vertex of the model space polygon 

25 is assigned, in addition to attribute values and its 
model space coordinates (x,y,z), a pair of depth- 
independent coordinates (u,v) into a depth- independent 
texture space. The texture space is considered herein 
to be "depth -independent" because it is defined with 

30 only two Cartesian coordinates (u and v) . For each 
vertex, homogenized image space coordinates are 
calculated using homogenous transformations. This 
calculation yields the homogeneity value 1 for the 
vertex, which is applied to the depth- independent 
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texture space coordinates (u,v) for the vertex to 
generate "depth -adjusted" texture coordinates (1,1). 
These can be thought of as coordinates into a "depth- 
adjusted texture space". 
5 Next, for each i'th pixel of the polygon as 

projected onto the image plane, in addition to 
calculating its new image space coordinates (*i,Zi) by 
interpolation, the depth-adjusted coordinates* 1 ( 5. , Zi ) 
into the depth -adjusted texture space are "'also 

10 calculated by interpolation. The homogeneity value -i- 
is also interpolated for the i'th pixel from the 
homogeneity values of the polygon vertices. Because 
the predefined texture map is indexed in depth- 
independent texture space and not depth- adjusted 

15 texture space, the texture coordinates (h,Zl) for the 
i'th pixel are then converted back to depth- independent 
texture space by multiplying through by w ; . This yields 
depth- independent coordinates (u f ,v,) into depth- 
independent texture space. The corresponding texture 

20 value T,- can then be retrieved and/or calculated from 
the predefined texture map and applied to the polygon 
as projected onto the image plane. 

In general, at least for one-point perspective 
projections onto an image plane perpendicular to the z 

25 axis, the homogeneity value produced by the 4x4 matrix 
calculation is related to the depth coordinate z of the 
point in model space by a linear relationship of the 
form 

w = az+P, 

30 where a and p are constants which depend on such 
variables as the position of the image plane on the z- 
axis in model space and the chosen center of projection 
(COP) . For projections onto the plane z=d with 
COP = (0,0,0), it can be shown that w = 1. 
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But while this single relationship between z and 
w is well known, conventional systems do not use 
homogeneity values for occlusion mapping, only for 
texture mapping, in order to maximize texture -mapping 
5 resolution in a given graphics rendering system, most 
conventional systems texture map each polygon 
separately. This enables the homogeneity values for 
the polygon (including its vertices and each of its 
interior pixels) to be scaled so as to occupy the full 

10 range of numeric values that can be carried by the 
hardware. The above texture mapping procedure is 
performed using these full-scale (1) values. 
Accordingly, while the homogeneity values of different 
surface regions of the polygon might have been 

15 initially related to the depth of the surface region in 
the overall rendered scene, after scaling, they are 
related only to their depths in the scene relative to 
the other surface regions of the same polygon. 

Thus, a traditional 3D graphics system computes 

20 two parameters for each pixel, a z value which is used 
in the z buffer, and a 1/w value which is used for 
perspective -correct texture mapping. The two are 
treated entirely independently. Additional complexity 
and cost is required in computing both the z value and 

25 1/w value. Further, computing both values, rather than 
one of the values, limits the speed with which an image 
may be rendered and displayed. Therefore, it is 
desirable to provide an apparatus and method in which 
only one of the two values need be calculated, but 

30 without significant loss of precision in either the 
texture-mapping or occlusion mapping operations. 
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SUMMARY QF THE TWVENTXON 
The above-mentioned problems are overcome in 
accordance with the invention by providing a graphics 
system that has a mult i -port memory subsystem storing 
image- rendering control lists and that further has a 
real-time programmably- configurable render-engine (also 
referred to herein as the » triangle -engine' or l TE') 
coupled to the memory subsystem and responsive to the 
stored image -rendering control lists as well as to 
supervisory immediate control commands issued by a 
supervisory CPU or the like. 

A triangle- engine in accordance with the 
invention reduces the load on system memory and on 
system CPU's by periodically fetching compact render- 
15 control lists and source image data from the memory 
subsystem on a list basis and by processing the fetched 
information without requiring continuous real-time 
intervention by system CPU's. This frees the system 
CPU (or CPU's) for managing other time- critical tasks 
of the graphics system. 

One embodiment of a triangle- engine {TE) in 
accordance with the invention includes an addressable 
register set that is programmable by one or both of a 
system CPU and a memory- resident control -list for 
programmably reconfiguring the TE in real-time to 
perform various image rendering operations such as 
surface color fill, surface texturing and hiding of 
overlapped surfaces. 

The one embodiment of the triangle -engine (TE) 
includes a linked-list instructions - fetch means (also 
referred to herein as a 'vertex- fetch' module or »VTX') 
for fetching from system memory, a compact set of draw 
instructions that are based on 3 -dimensional vector - 
point models of object surfaces to be rendered and 
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displayed. 

'Rendering' refers herein to the step of writing 
data into a frame-buffer portion (FB) of memory 
irrespective of whether that FB is the currently- being - 
5 displayed 'active' FB or a later -to -be -displayed 
alternate FB. 

'Displaying' refers herein to the step of reading 
data from an active frame-buffer and transforming the 
read data into a video signal for viewing on a video 
10 monitor or the like. 

The TE embodiment further includes control 
derivation means (also referred to herein as a 'set-up' 
module) for deriving span-walking control data from the 
vector -based modeling data fetched by the vertex module 

15 so that lines and/or areas bounded by vertex points may 
be stepped across and so that bounded areas may be 
filled with corresponding colors in accordance with a 
bi- linear interpolation scheme. 

The TE embodiment additionally includes an edge- 

20 walker module (EWK) , responsive to edge-walking control 
data (supplied from the set-up module or elsewhere) , 
for defining by means of interpolation, various image- 
defining parameters distributed at points along a first 
polygon edge (line) having a respective set of two pre- 

25 defined vertex points, and for defining by means of 
similar interpolation, a subset of similar parameters 
distributed at points along a second polygon edge 
(line) having a respective set of two pre-defined 
vertex points. (In one embodiment, the point parameter 

30 subset of the second walked edge consists of only x and 
y for each point while the parameter set of the first 
walked edge includes, in addition to x and y, further 
point parameters, 1/w, R, G, B, A, u/w, and v/w.) The 
edge- walker module also determines which frame -buffer 
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pixels are to be considered 'inside' a walked polygon 
and which are to be deemed 'outside'. Rendering 
activities for a given polygon are limited to frame- 
buffer pixels 'inside' the walked polygon. m an 
5 embodiment, the polygons which the TE can render are 
limited to triangles. 

The image -defining parameters (also sometimes 
referred to herein as attribute values) of points of a 
walked triangle include but are not limited to: 
10 (a) native -surface color component values such 

as R, G, and B that are respectively 
attached to each point; 

(b) a blending factor referred to herein as 'A' 
that is respectively attached to each point 
and is used for blending the native -surface 
color component values with respective prior 
frame-buffer values and/or respective 
texturizing values; 

(c) depth-adjusted texture mapping coordinates 
that are respectively attached to each point 
and are referred to herein as 'u/w' and 
'v/w' , where u and v are coordinates of 
bitmapped 2 -dimensional texture image 
defined using depth- independent coordinates, 
the texture image having its own R, G, B, 
and/or A values for each of its points; and 

(d) a homogeneity factor referred to herein as 
'l/w' . 
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The TE additionally includes a span-walker module 
(SWK) , responsive to span-walking control data 
(supplied from the set-up module and/or from the edg&- 
walker module or elsewhere) , for defining by means of 
bi-linear interpolation, various image -defining 
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parameters along a third line having two pre-defined 
endpoints. The endpoints of the third line are usually 
defined by the edge-walker module as the edge-walker 
module steps pair-wise to the next set of points along 
5 the vertex- defined first and second edges of the 
triangle. The image -defining parameters for interior 
points (IP) of the triangle that are similar to those 
of the edge-walker module (R, G, B, A, u/w, v/w and 
l/w) . Span- walking may be thought of as a pre- render 

10 parameter filling operation for an area between two 
edge lines of the polygon. 

The TE embodiment additionally includes a 
texture-mapping module (TXT) , responsive to span-walker 
result data (supplied from the span-walker module or 

15 elsewhere), for defining what texture pixel (s) are to 
be blended, if at all, with the native pixels of a 
span-walked area. 

The TE embodiment additionally includes a data 
blending module (DBL) , operatively coupled to the span- 

20 walker module (SWK) , to the texture -mapping module 
(TXT) , to a frame-buffer (FB) and to a depth buffer 
(1/w-buffer) , for blending together the outputs of the 
SWK and the TXT and the current contents of the FB and, 
depending on a programmably- definable comparison of a 

25 current-pixel depth-value with a corresponding value 
already in the 1/w-buffer, writing the blended value 
back to the frame-buffer (FB) . 

A render- control list is provided within system 
memory in accordance with the invention for controlling 

3 0 the TE . The render -control list includes a block- 
header that defines the size (instructions -count) of a 
succeeding list of instructions and the nature of the 
instructions (e.g., long- format versus short - format ) . 
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The block-header includes a hardware version- 
number field for identifying versions of hardware that 
may or may not execute the succeeding list of 
instructions . 

5 The block- header also includes a vertex replace - 

mode field for controlling automatic generation of 
% strip' and x fan' forms of triangle chains. 

The block- header further includes function 
enable/bypass fields for programmably enabling on a 

10 block-by-block basis, a shading function that applies 
native shades to portions of the to -be -rendered area, 
a texturing function that applies textures to portions 
of the to -be -rendered area, and a perspective weight 
and use function that applies normalized perspective 

15 values to portions of the to-be- rendered area and that 
uses those normalized perspective values for occlusion 
mapping. Processing time can be shortened by bypassing 
those functions that have no substantial effect on the 
final rendering. Data structures within the block 

20 contract accordingly. 

For example, if all texture blend factors of 
a to -be -applied texture are such that the texture 
bitmap will essentially hide all of the surface native 
coloration, then the native -surface shading function 

25 should be bypassed to avoid wasting time on 
computations that are of no essential consequence to 
the final rendering, 

A variety of rendering effects can be obtained 
by, for example, modulating the various fields of each 

30 block-header. Further rendering effects can be 
obtained by re -arranging the order in which vertex 
information is listed in the block. Yet further 
rendering effects can be obtained by reconfiguring the 
settings of the control register set in the triangle- 
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engine (TE) . These and other options will become 
apparent in the detailed description below. 

One of the notable features of the triangle - 
engine (TE) is that certain of its control registers 
5 nay be modified only by a command- giving device (e.g., 
CPU) that is deemed to be a 'privileged supervisor' as 
opposed to another accessing device that is not 
privileged. If a non- privileged device tries to modify 
a privileged register, an access violation interrupt is 

10 generated by the TE. This prevents wayward software 
from taking control of the TE (via the deferred 
instructions path) and directing the TE to write over 
unintended, crucial areas of system memory, thereby 
crashing the system. 

15 in an aspect of the invention, the 1/w value is 

used instead of a z value to implement a depth buffer 
("l/w buffer") which provides the same occlusion- 
mapping functionality as a traditional z buffer. 
Significantly, this allows a hardware or software 

20 graphics system to handle only one value, l/w, instead 
of both a l/w and a z value, resulting in a saving in 
complexity which in turn results in a cheaper and/or 
smaller and/or faster graphic system. 

Since the l/w buffer uses numbers that are 

25 roughly inversely proportional to z values, instead of 
linearly proportional, it is not immediately apparent 
that the l/w buffer will perform the intended function 
of discriminating between objects that are at different 
distances from the observer. A scaling technique is 

30 used and described below. 

Use of homogeneity value l/w also restricts 
manipulation of the l/w parameter for optimal usage of 
hardware data path widths for texture mapping, a 
hardware and software protocol that circumvents the 
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problems caused by these restrictions is also provided 
and described below. 

The use of the homogeneity value for occlusion 
mapping is better illustrated with reference to Pig. 
10, which illustrates conventional rendering of objects 
1002 and 1004 from a model 1006 using a 16-bit 
canonical z buffer. With 16 bits, the decimal numbers 
0 through S5535 are represented. Minimum and maximum z 
values or distances are defined such that all objects 
that will be included in the scene will be between 
these two limits. These distances are known as a 
-hither" value and a "yon" value. The hither z is 
assigned a z buffer value of o, and the yon z is 
assigned a z buffer value of 65535. The interval 
15 between hither and yon is split into 65536 pieces of 
equal size. The z value of any point on any object, in 
the scene is rounded to the nearest integer between 
hither and yon. For example, a point that was exactly 
half way between hither and yon would be given the z 
20 value of 32768. 

The resolution of a z buffer is defined as the 
smallest separation in z that can be distinguished in 
the z buffer. For example, in Fig. 10, assuming that 
the hither value is at Z=0 and the yon value is at 
Z=66535, there may be two points of interest in the 
scene, one at Z=ll05, the other at Z-1105.4. Both of 
these points will be represented with a value of 1105 
in the z buffer, since there are only enough bits (16 
in this example) to represent integers. 

The resolution of the z buffer can be increased 
by using more bits in the z buffer, but only at the 
expense of cost, speed and complexity of the system. 
The resolution can be increased also by reducing the 
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distance between hither and yon, but then the nearest 
and farthest objects of the model will be cut off. 

In order to use homogeneity factors for occlusion 
mapping, the l/w parameter must provide the same 
5 information as the z parameter for the purposes of 
determining the relative distances of objects along the 
z-axis. Additionally, the resolution of the l/w buffer 
must be sufficient for use in a 3D graphics system. 

The assertion that the l/w buffer provides the 
10 same functionality as the z buffer is easily proven by 
noting that (1) is related monotonically to z because 

= -ah^' lcc 311(1 P are fixed for a given 
projection.) This implies some restrictions on the 
handling of l/w, which are discussed below. if the 

15 restrictions are observed, as in the equation above, 
the l/w parameter can be used to discriminate objects 
in z space just as well as the z parameter can. 

In order to resolve the l/w buffer resolution 
problem, it is first necessary to note that the 

20 resolution of the l/w buffer is not the same as the 
resolution of the z buffer. Pig. n illustrates the 
differences in resolution between a typical z buffer 
and the l/w buffer. The resolution of the z buffer is 
constant throughout its range. Hence the term "linear" 

25 z buffer. This is illustrated as horizontal line 1102 
between the hither and yon z -values in the left-hand 
graph in Pig. n. The resolution of the l/w buffer, 
however, is not constant over the range of values 
between 0 and 1, but rather decreases as z increases. 

30 For example, assume that a 16 -bit l/w buffer is 

implemented. All l/w values will fall in the range of 
1.0 (the nearest point) to about 0.0 (the farthest 
point). The range between l.o and 0.0 is divided into 
65536 equal parts, just as with the prototypical z 
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buffer. Thus, each increment in the l/w buffer 
represents 1/66536 of the range between 1.0 and 0.0, 
which is 1.0/65536 - .000015. 

Table I shows the l/w values for pairs of 
adjacent z values. The third column shows the delta 
between the l/w value for particular pairs of rows in 
the table. Note that the deltas in l/w get smaller for 
constant deltas in z as z increases. 



10 



TABLE I 



16-bit l/w buffer, hither at z=i 



B z value 


l/w value 


delta l/w 


l 


1 


0.5 


2 


1/2 = 0.5 


10 


l/iq = o.i 


0.01 


I 11 " 


1/11 =0.09 


I 100 


1/100 =0.01 


0.0001 


I 101 


1/101 - 0.0099 


J . 1000 


0.001 


0.000001 I 


I 1001 


0.000999 



15 



20 



25 



30 



Since each increment in the l/w buffer represents 
a numeric interval of 0.000015, it can be seen from the 
table that approximately .0001/. 000015 = 7 different 
depths can be distinguished between z=l00 z=10l. 
In other words, the resolution of the l/w buffer at 
z=100 is approximately 1/7 = 0.14. For example, 
imagine that the units of z are measured in feet. At 
a depth of 100 feet into the scene, objects that are 
separated by more than 0.14 feet (about two inches) can 
be resolved. For objects which are less that two 
inches apart depthwise, it will be impossible to 
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determine with the plain l/w buffer which object is in 
front . 

In contrast to the above, the delta in l/w 
between z-10 and z-u is about 0.01/0.000015 = 700. 
5 This means that 700 different depths can be 
distinguished between z=10 and z=ll using the l/w 
buffer. Again, if units in z are interpreted as feet, 
then at a depth of 10 feet, objects that are separated 
by more than 1/700 or 0.0014 feet (or about 3/160 of an 

10 inch) can be resolved. The resolution at z=l0 
therefore is much greater than the resolution at z=100. 

Thus, the resolution of the l/w buffer decreases 
with increasing z, as shown in Pig. n. 

In Fig. 11, horizontal line 1104 indicates an 

15 "acceptable resolution" for a 3D graphics system. 
Resolution better than the resolution represented by 
this line (i.e., above this line in Pig. n) results in 
satisfactory quality of rendered images, whereas 
resolution below this line is unacceptable. The exact 

20 location of the line 1104 depends on the design goals 
of the system. In the l/w buffer, because resolution 
decreases with increasing z, the resolution may drop 
below the acceptable level above some z values. 

To a first approximation, it will be appreciated 

25 that a depth buffer which decreases in resolution with 
increasing distance is not necessarily bad. In fact, 
it meshes well with human vision in terms of things 
near and far. Human beings tend to have better depth 
perception for objects close by than for those far 

3 0 away. A form of visual realism is therefore inherently 
created by using a digital word of finite bit width for 
storing the depth factor l/w. its resolving ability 
inherently mimics the ability of the human vision 
system to see depth perception at better resolution 
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close up than far away. In other words, the true level 
of acceptable resolution might vary with depth 
similarly to the 1 buffer resolution (see dotted curve 
1108 in Fig. li) . The acceptable level of resolution 
5 curve 1108 may or may not cross the actual 1 buffer 
resolution curve 1106. 

However, the similarity of the acceptable level 
of resolution curve to the actual 1 buffer resolution 
curve 1106 really applies only in scenes intended to 

10 represent great depths (e.g., depths measured in 
hundreds of feet) . In scenes depicting models extending 
only a few inches or feet in depth, human vision would 
resolve all such depths equally. The acceptable level 
of resolution curve for such a scene would more likely 

15 be flat, such as line 1104. While the concept of a 
plain 1 buffer works well for many scenes, therefore, 
it would be desirable to enhance the concept to provide 
acceptable depth resolution in all scenes. 

Accordingly, in another aspect of the invention, 

20 a new method of normalizing and storing the 1/w values 
in the l/w buffer that provides increased resolution 
over a desired range of z values is provided. The 
following discussion will focus on a hardware 
embodiment of normalizing a 16 bit l/w buffer. 

25 However, the techniques are applicable to software 
embodiments and also hardware embodiments with 
different size l/w buffers. 

The key to the normalization technique is to 
consider the entire range of 1 values to be made up of • 

30 a number of sub- ranges. In general, these sub -ranges 
do not need to be distinct. Because values in digital 
hardware are quantized, it will be appreciated that 
each of the 1 sub- ranges contains only discrete values. 
Then each 1 value can be represented with two numbers 
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(e,s), where e, referred to herein as the "w- range" 
value, indicates which of the sub- ranges to look in, 
and s, referred to herein as the significand, indicates 
which discrete value within the e'th sub-range is the 
5 desired 1 value. 

w 

In an embodiment, four sub- ranges are used, such 
that e can take on values 0-3. Also in the embodiment, 
the discrete values within each sub -range divide the 
sub-range equally, and instead of having to look up the 

10 s'th value in the e'th sub-range, s is considered to 
range from 0 to 1 and the 1 value represented by (e,s) 
can be calculated from e and s according to the 
function 1 - s x 2* be , where b is a parameter which can 
in general vary with e. Still further, b is chosen in 

15 the embodiment as b-3 for all e, such that 1 = s x 2* 3e . 
Thus if 16 bits are available for storing depth values 



20 



ina i buffer, the 


16 bits might be divided as shown in 


Table II: 




TABLE II 


EXAMPLE 


BLOCK- FIX REPRESENTATION 


| e (2 bits) 


s (14 bits) | 


15 14 


13 


0 



The representation of 1 values in the form (e,s) 
25 is referred to herein as a "block- fix" representation. 
Block- fix representation of numbers is known in 
general, but it has not previously been used to 
represent depth values for occlusion mapping, or to 
represent homogeneity values for texture mapping. Pull 
30 floating point representation has been used in the past 
to represent homogeneity values for texture mapping, as 
has full fixed point representation. Similarly, full 
floating point representation has been used in the past 
to represent linear z-buffer values, as has full fixed 
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point representation. Pull floating point 

representation of l/w values as described herein would 
make the hardware prohibitively expensive for low- cost 
implementations . As explained in more detail below, 
5 the use of a block- fix representation achieves nearly 
the same benefits as full floating point 
representation, and yields surprising advantages when 
used for texture mapping in particular. 

In the embodiment, 1 values are passed to the 

10 triangle engine in full floating point format (8 -bit 
exponent and 24 -bit mantissa) . The full 1 range is 0 
to 1. Table III shows how various numbers are 
represented in traditional fixed-point format, and in 
the block -fix representation or format according to the 

15 present invention. Notice that for smaller numbers, 
many significant bits are lost in the fixed-point 
representation because of the leading 0 bits . The 
block-fix format allows these leading bits to be 
represented in the exponent, leaving more room in the 

20 significand for significant bits of data. 



WO 96/36011 



PCT/US96/06653 



-25- 



TABLE Hi 



5 



Range 


sample 
1/w 


Fixed- Point 
Representation 


Block -Fix 
Representation 


w- range 


significand 


8 w 


.5 


.1000000000000000 


00 


.10000000000000 


.25 


.0100000000000000 


00 


.01000000000000 


.125 


.0010000000000000 


00 


.00100000000000 


1 i 1 < 1 
64 w 8 


.0625 


.0001000000000000 


01 


.10000000000000 


.05 


.0000110011001100 


01 


.01100110011001 


1 1 
512* v 

<^ 


.01 


. 0000001010001111 


01 


.10100011110101 


.005 


.0000000101000111 


10 


.01010001111010 


w 512 


.001 


.0000000001000001 


10 


.10000011000100 


.0005 


.0000000000100000 


10 


.01000001100010 


.0001 


.0000000000000110 


11 


.00001101000110 



The resolution of the l/w buffer with normalized 
10 l/w values is shown as curve 1202 in Fig. 12. Because 
two bits are devoted to the exponent, the initial 
resolution for a normalized 1 buffer is less than that 
of the fixed-point ("plain") l/w buffer having the same 
data width. As the z value increases, the resolution 
15 falls off with the same kind of curve shown as 1106 in 
the graph in Fig. 11. However, when the z value gets 
large enough to switch to the next 1 sub -range, the 
resolution is bumped back up, and starts falling off 
again. After all possible sub- ranges have been used, 
20 the resolution proceeds to decrease monotonically. 

Note that as z increases and approaches the 
beginning of the next sub -range in 1, there is no 

w 

inherent reason that e be incremented and the next sub- 
range be in fact used. Instead, it is actually 
25 desirable in certain circumstances to remain in the 
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same sub-range (albeit with decreasing depth 
resolution) . This is possible because the sub- ranges 
chosen for the block-fix representation in the present 
invention overlap (i.e., are not distinct). Thus a 
single (1) value can be represented in the present 
embodiment with more than one (e,s) pair, although 
depth resolution is best in the representation in which 
e is highest. 

Although the specific description in the 
preceding paragraphs refer to a 16 -bit depth buffer, 
with two bits used for e and 14 bits used as the 
significand, it will be readily apparent to those 
skilled in the art that l/w buffers of different bit 
widths may be implemented. Further, different subsets 
15 of the total bits may be assigned to the e and s 
fields. The choice will depend on the required 
precision/resolution of the system. 

In a typical 3D graphics system that does not use 
homogeneity values for occlusion mapping, there is some 
20 freedom to adjust the l/w parameter associated with 
each pixel independently of the z value. Specifically, 
perspective- correct texture association works well if 
each individual object (e.g., triangle) in a scene is 
first scaled with respect to l/w such that the l/w 
25 value of the nearest point (the point with the smallest 
z value) is 1.0. The scaling is accomplished by 
multiplying l/z by a constant j , where j can be 
determined differently for each object in the scene. 
Fig. 13 illustrates scaling a l/w parameter on an 
30 object by object basis. The advantage of such scaling 
is that the precision requirements of the software or 
hardware that implement the texture calculations are 
reduced, since numbers have been scaled to their 
optimal values. 
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When using the homogeneity value for occlusion 
mapping as well, it is no longer permissible to scale 
1/w independently for each object. Instead, any 
scaling which takes place must be uniform for all 
5 objects subject to occlusion mapping. Otherwise, it 
would no longer be possible to determine the relative 
depths of one object to the other, only the relative 
depths within a single object. Pig. 14 illustrates 
using a constant scaling factor k according to the 

10 present invention. 

Because of this restriction, the use of 
homogeneity values for both occlusion mapping and 
texture mapping increases the precision requirements 
for the hardware or software that handles the texture 

15 association calculations. In yet another aspect of the 
invention, the effect of this increase is minimized 
through the use of a technique to reduce the complexity 
of these calculations, as described below. 

At each i'th image region (e.g. pixel), in a 

20 typical 3D graphics system, a pair of texture 
coordinates u f and v f are calculated to determine what 
region (s) of texture space (e.g. which texel(s)) should 
be mapped onto this pixel. The calculation is as 
follows, where i, and Zi are already known for each 

25 pixel: 1 1 

V./V. 

However, if each of the ± have been normalized using 
the block- fix representation described above, this 
calculation is complicated by requiring ± to be de- 
normalized prior to determining u f and v. . 1 

The requirement for de- normalizing the 1/w value 
can be removed if the u/w and v/w values are also 
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normalized by the same amount as l/w for each vertex 
prior to rendering each triangle. The process of 
normalization can be thought of as scaling by a 
constant m. (In the present embodiment, m=2 3 *.) In. this 
5 case, at each pixel we have ±*m(±), and 4 «■<■£) and 
-± = a(-l) . if we perform the same calculations as 
before, namely: 

u = . mlu *'"J and v - a( V y i ) 

10 a ffld/w,) i - {1/w /, - in (i/w.) ' 

it is apparent that the constant of normalization, m, 
by virtue of its presence in both the numerator and 
denominator of the calculation above, cancels out. The 
result of the operation is u f and v s , as desired. The 

15 need for denonnalizing l/w for the calculation of 
texture coordinates has been eliminated. 

Surprisingly, therefore, the use of the inventive 
block- fix representation to enhance resolution when 
homogeneity values are used for occlusion mapping, does 

20 not necessarily increase hardware conplexity for the 
use of the same block- fixed homogeneity values for 
texture mapping. Moreover, an additional benefit of 
the normalization of u/w and v/w at the triangle 
vertices is that more precision is maintained in these 

25 parameters during interpolation, in the same way that 
normalizing l/w increases its precision as described 
above . 

The above and other features of the graphics 
system will become clearer by referring to the below 
30 detailed description. 
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BRIEF DEfiC7PTP T i ON QF toe drawtnGS 

The detailed description below makes reference to 
the accompanying drawings, in which: 

Fig* 1A is a block diagram of a graphics system 
5 in accordance with the invention that includes a 
programmably- configurable triangle -engine (TE) ; 

Fig. IB is a close-up block diagram showing more 
details of a triangle -engine (TE) in accordance with 
the invention and the parts of the graphics system with 
10 which the TE interacts; 

Fig. 2A is a perspective view for explaining how 
object -approximating 3D frameworks are formed with 
vertex points and how projective perspective is 
automatically compensated for by digital words of 
15 finite bit -width representing the value 1/w; 

Fig. 2B is a perspective view showing a possible 
native coloration of the framework surface; 

Fig. 2C is a perspective view for explaining bi- 
linear interpolation across the surface of a 3D 
20 triangle; 

Fig. 2D is a perspective view for explaining the 
mapped blend- in of texture plane information onto the 
object surface; 

Fig. 3 is a 2D plot for explaining how an edge- 
25 walker steps through frame-buffer space, down a long 
edge of a triangle and defines render start and stop 
pixels for a subsequent span- walker. 

Fig. 4 is a block diagram showing pipelined 
subdivision of tasks between the CPU and the triangle 
30 engine; 

Fig. 5 is a timing diagram showing pipelined 
generation of deferred instructions and later execution 
of same by the triangle engine; 
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Pig. 6 shows the data structure of a group of 
short -format vertex instruction blocks and a 
corresponding block- defining header; 

Fig. 7 illustrates how a deferred instructions 
list is constructed at a tail end while a higher up 
portion is simultaneously executed by the triangle- 
engine; 

Fig. 8 illustrates a rendering of two graphic 
ob j ects ; 

Fig. 9 illustrates a texture mapping of a texture 
onto a surface of a graphic object; 

Fig. 10 illustrates rendering objects in a scene 
using a 16 -bit canonical z buffer; 

Fig. 11 illustrates the comparative resolutions 
15 of the z buffer and l/w buffer values; 

Fig. 12 illustrates depth value resolution for 
l/w values .normalized according to an aspect of the 
present invention; 

Fig. 13 illustrates scaling of l/w parameters on 
20 an object-by- object basis; 

Fig. 14 illustrates scaling of all l/w parameters 
by a constant scaling factor k; 

Fig. 15 is a block diagram illustrating the major 
functional blocks of the set-up engine 152 {Pig. IB) ; 
25 Fig. 16 is a block diagram of the major 

functional units of the edge walker 153 (Pig. IB) ; 

Fig. 17 illustrates a triangle edge for 
calculation of a characteristic function; 

Pig. 18 illustrates the major functional blocks 
of the color data path of Pig. 16; 

Fig. 19 is a block diagram illustrating the major 
functional blocks of the span walker 154; 

Fig. 20 illustrates the major functional blocks 
of the uv data path of Fig. 16; and 
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Figs. 21A and 21B are a flow chart of steps 
performed in the rendering of a model. 

DETAILED DESCRTPTTQ TJ 

5 Referring to Fig. LA, a block diagram of an image 

processing, rendering and display system 100 (graphics 
system 100) in accordance with the invention is shown. 

A key feature of system 10 o is that it is 
relatively low in cost and yet it provides mechanisms 
10 for handling complex, 3 -dimensional (3D) image scenes 
in real time and displaying them such that they appear 
to have relatively high resolution and a wide variety 
of colors and/or shades per displayed frame. 

This feature is made possible in part by 
15 including within the system 100, a triangle -engine (TE) 
150 that renders image data into a frame-buffer (FB) of 
system memory on a real-time basis. 

This feature is further made possible by 
including within the system 100, an Image -Enhancing And 
20 Rendering Subsystem (I-EARS) comprised of: a video 
post -processor (VPP) 160, a digital encoder (DENC) 170, 
and a plurality of digital -to- analog converters (DAC's) 
175 . 

The triangle- engine (TE) 150 and the image - 
25 enhancing and rendering subsystem (I-EARS) 160-170-175 
may each be formed within one integrated circuit (IC) 
chip or may be distributed across a few IC chips. In 
one particular embodiment, elements 150, 160, 170 and 
175 are implemented within one integrated circuit (IC) 
30 chip together with elements 115, 120, 121, 122, 123, 
124, and 148 of Fig. 1A. This one IC chip is referred 
to herein as the ' I-EARS chip'. 

Within the I-EARS group of elements, 160-170-175, 
there is provided one or more of : a set of user- 
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programmable Color Lookup Table modules (GLUT'S, not 
shown) for defining on a per scanline basis, a palette 
of colors available for display; an image dithering 
circuit (not shown) ; and an image filtering circuit 
5 (not shown) . The functions of the image dithering 
circuit (not shown) and image filtering circuit (not 
shown) can be programmably bypassed on a by-the-pixel 
basis. Also optionally provided, are a hardwired 
pseudo- linear CLUT circuit (not shown) which can be 

10 programmably substituted for one of the user- 
programmable CLUT's on a by-the-pixel basis; and a 
user -programmable resolution- enhancing interpolator 
(not shown) which can be used to produce twice as many 
unique display lines and/or twice as many unique 

15 display columns as there are respective lines and 
columns in the active frame-buffer. A more detailed 
description of the I -EARS group of elements, 160-170- 
175, may be found in the co-pending U.S. Patent 
Application Serial No. 08/438,630, filed May 10, 1995, 

20 entitled CONFIGURABLE VIDEO DISPLAY SYSTEM HAVING LIST- 
BASED CONTROL MECHANISM FOR BY- THE -LINE AND BY-THE- 
PIXEL MODIFICATION OF DISPLAYED FRAMES AND METHOD OF 
OPERATING SAME, by inventors R. Thaik, et al. (Attorney 
Docket No. MDIO 4510) , which is incorporated herein by 

25 reference and as whose details will therefore not be 
repeated here. 

The. operations and importance of the triangle - 
engine (TE) 150 and various data blocks within system 
memory may be better appreciated by first considering 

30 the video processing operations of graphics system 100 
in an overall sense. 

System 100 includes a real-time video display 
unit (VDU) 180 such as an NTSC standard television 
monitor or a PAL standard television monitor or a 640- 
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by-480 VGA computer monitor or a higher- resolution 
monitor. The VDU 180 is used for displaying high- 
resolution animated images 185 to a system user 190. 

The video display unit (VDU) 180 may also include 
5 audio output means (not shown) for simultaneously 
producing and supplying corresponding multi- phonic or 
monophonic sound to the system user 190. 

As the image- enhancing and rendering subsystem 
(I-EARS) 160-170-175 drives the video display unit 

10 (VDU) 180, a system user 190 observes the displayed 
imagery 185 and hears any accompanying audio by way of 
audio- visual coupling path 191. An interactive 
response mechanism 195 is provided in the form for 
example of a joystick and/or a pad of push-buttons for 

15 allowing the system user 190 to feedback to the 
hardware (e.g., CPU 110) his or her real-time responses 
to the perceived audiovisual show by way of interface 
105. 

The image -enhancing and rendering subsystem 
20 (I-EARS) 160-170-175 has a pipelined structure that 
moves data signals synchronously from an upstream 
portion to a downstream portion. The downstream 
portion of the I-EARS 160-170-175 is clocked by a video 
clock generator (vidCLK) 167 that operates according to 
25 pre- specified pixel and control rates of the VDU 180. 
In one embodiment, the vidCLK rate is approximately 15 
MHz. 

The digital encoder (DENC) 170 includes a timing 
section that generates frame synchronization signals 
30 such as vertical synch pulses (V-synch) and horizontal 
synch pulses (H- synch) in synchronism with the vidCLK 
167. 

The upstream portion of the I-EARS 160-170-175 is 
clocked by a system clock generator (sysCLK) 117 that 
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operates according to a pre- specified system rate. The 
system rate is typically different from the vidCLK 
rate. In one embodiment, the sysCLK rate is 
approximately 66 MHz. The sysCLK 117 is used for 
5 driving other portions of system 100 such as the 
illustrated CPU 110 and triangle -engine 150. 

System 100 further includes a real-time 
audiovisual -data processing subsystem comprised of: a 
basic I/O interface module 105, a general purpose 

10 central -processing unit (CPU) no, a multi-port memory 
130-131, optionally a plurality of 2 -dimensional image 
rendering engines (eel-engines) 148, and the already- 
mentioned 3 -dimensional image rendering engine 
(triangle -engine) 150. 

15 ' nx * real-time audiovisual -data processing 

subsystem may include other image -data manipulating 
engines and audio- data manipulating engines as 
indicated at 149. 

In one embodiment, the memory 130-131 is formed 
20 as two simultaneously and independently addressable 
units, 130 and 131. Data in system memory 130-131 is 
preferably interleaved across a same address space as 
4-KByte pages. Units 130 and 131 preferably utilize 
SDRAM technology (synchronous dynamic random access 
25 memory) . They may also utilize other high-speed random 
access data storage technologies such as video- speed 
static random- access memory subunit (VSRAM) . 

Access to the first memory unit (MEMO) 130 is 
managed by a corresponding first memory access control 
30 unit (MACO) 120. 

Access to the second memory unit (MEMl) 13 1 is 
managed by a respective and independent second memory 
access control unit (MAC1) 121. 
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The MACO unit 120 may exercise exclusive control 
over the address (AO) and control (CO) buses of the 
first memory unit 130 . Any other device which wishes 
to gain access to the AO and CO buses then has to send 
5 a corresponding request to the MACO unit 120 over 
address/control bus 124. The MACO unit 120 arbitrates 
among contending requestors and grants access to the 
highest priority requestor. 

Similarly, the MAC1 unit 121 may exercise 

10 exclusive control over the address (Al) and control 
(Cl) buses of the second memory unit 131. Any other 
device which wishes to gain access to the Al and Cl 
buses then has to send a corresponding request to the 
MAC1 unit 121 over address /control bus 124. The MAC1 

15 unit 121 arbitrates among contending requestors *nri 
grants access to the highest priority requestor. 

The first memory unit 130 has a 32 -bit wide 
bidirectional data bus 122 which is also designated as 
*D0'. The second memory unit 131 has a 32-bit wide 

20 bidirectional data bus 123 which is also designated as 
*D1'. Devices such as CPU 110, VPP 160 and triangle- 
engine 150, read and write data into memory units 130 
and 131 by way of respective data buses DO and Dl. The 
total data bus width is thus 64 bits. 

25 If desired, the illustrated, parallel memory 

architecture may be expanded to include additional 
memory units (MEM's) such as 130-131 interleaved 
address -wise with units 130-131 and respective 
additional memory access control units (MAC's) such as 

30 120-121. The advantage of this parallel memory 
architecture is, of course, that multiple requestors 
can obtain simultaneous access to different parts of a 
centralized memory. It is the responsibility of the 
operating system software (OS) to see to it that 
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contentions for a same memory unit {e.g., 130 or 131) 
are minimized. 

The operating system (OS) software can be stored 
in a nonvolatile storage unit such as ROM (not shown) 
5 or CD-ROM (not shown) that is operatively coupled to, 
or forms part of, the system memory 130-131. For high 
speed nonvolatile storage such as ROM, the OS 
instructions can be executed directly from that 
storage. For slower- speed nonvolatile storage such as 
10 CD-ROM (compact disk), the OS instructions can be 
downloaded into system ram 130-131 and executed from 
there. 

The illustrated embodiment of the image 
processing and display system 100 uses a PowerPC™ 

15 602 CPU, 110 such as made by International Business 
Machines (IBM) of Armonk, New York. Other CPU's may of 
course be used. 

The PowerPC™ CPU no has its own processor data 
bus (PD) 112 and its own address/control bus (PA/C) 

20 114. The PowerPC™ data and address/control buses, 112 
and 114, couple to the memory data and memory address/ - 
control buses, 122, 123 and 124, by way of a bus 
interface unit lis. Signals from peripheral devices 
such as the interactive response mechanism 195 (e.g., 

25 the user- joys tick) couple to the PowerPC™ data and 
address/control buses, 112 and 114, by way of the 
basic -I/O. interface unit 105. An internal cache memory 
($) is provided within the PowerPC™ chip for enabling 
data processing by the CPU 110 at the same time that 

30 other devices access the system memory 130-131. Cache 
coherency is maintained in one embodiment through a 
snooping mechanism. 

CPU no has at least two operating modes, 
referred to here as 'normal mode' and 'privileged 
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supervisory mode' . The operating system (OS) software 
is provided with the exclusive ability to switch the 
CPU 110 from one to the other of the normal and 
privileged supervisory modes. A control signal on the 
5 PowerPC™ address/control bus 114 indicates whether the 
CPU is in normal or privileged supervisory mode. The 
PPC bus interface unit lis copies that signal onto 
memory address /control bus 124 and makes the same 
indication available to the triangle- engine 150 so that 

10 engine 150 can determine whether the CPU is in normal 
or privileged supervisory mode. Software modules that 
are to modify privileged- access registers within the TE 
should do so by way of the OS application program 
interface (API) . in one embodiment, every access to a 

15 TE register that comes by way of the bus interface unit 
115 is automatically deemed to be privileged while 
every access to a TE register that comes by way of the 
memory interface module 151 is automatically deemed to 
be non-privileged. 

20 Instructions and/or image data are loadable into 

the memory units 130-131 from a variety of sources (hot 
shown) , including but not limited to magnetic or 
optical floppy or hard disk drives, a CD-ROM drive, a 
silicon ROM (read-only-memory) device, a cable headend, 

25 a wireless broadcast receiver, a telephone modem, etc. 
Such downloads typically come by way of the basic -I/O 
interface unit 105 . 

The downloaded instructions are not necessarily 
for execution by the CPU no. They may instead be 

30 directed to the triangle-engine 150 as will be seen 
below, or to other instruction executing components 
such as, but not limited to, the VPP 160 and other 
audiovisual data manipulating engines 148-149. 
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Buses 122, 123 and 124 depict in a general sense 
the respective data and control paths for moving 
instructions and image data into and out of memory 
units 130-131. Downloaded image data (including native 
5 and texturing data) can be in compressed or 
decompressed format. Compressed image data may be 
temporarily stored in a compressed image buffer of 
memory 130-131 and expanded into decompressed format 
either in system memory or inside the triangle -engine 
10 on an as needed basis. Immediately- displayable image 
data, such as that of a below-described frame buffer 
(PBO 140) may be maintained as a linked list of non- 
compressed data. 

The CPU 110 sets one or more 'forced- address' 
15 registers within the VPP 160 to point to respective 
locations in the system-memory 130-131 that store the 
start of a currently active 'Video Display List' (VDL) . 
The VDL in turn points to the currently 'active' frame- 
buffer. The VDL has a linked- list structure wherein 
the start block of a VDL can point to a second block or 
back to itself, the second block can point to a third 
block or back to itself, and so forth. A video display 
list (VDL) is deemed exhausted when the video post- 
processor (VPP) 160 has generated a software-defined 
25 number of screen lines. 

After a 'valid' VDL start location is written 
into one of plural 'forced- address' registers in VPP 
160 a DMA controller within the VPP 160 begins to 
periodically send requests to the corresponding MAC 
(120 or 121) far successive blocks of the active Video 
Display List (VDL) that contain image data and render 
control words until the VDL is exhausted. The VPP 160 
stores the fetched control words and processes the 
fetched image data accordingly so as to generate video 
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signals for a corresponding image 185 on the video 
display unit (VDU) 180. 

At the same time that the video post -processor 
(VPP) 160 is repeatedly fetching items from the active 
5 video display list (VDL) and generating display signals 
of the corresponding imagery for consumption by VDU 
180, the CPU 110, the triangle -engine (TE) 150 and/or 
the other image manipulating engines 148-149 can access 
binary-coded data stored within the memory 130-131 and 

10 modify the stored data within memory 130-131 at a 
sufficiently high- rate of speed to create an illusion 
for an observer 190 that real-time animation is 
occurring in a high- resolution image 185 (e.g., 640-by- 
480 pixels, 24 bits-per-pixel) then being displayed on 

15 video display unit 180. 

In many instances, the observer 190 will be 
interacting with the animated image 185 by operating 
buttons or a joystick or other input means of the 
interactive response mechanism 195. The system user's 

20 real-time responses are fed back to the basic- I/O 
interface unit 105 and/or to the CPU 110 and the latter 
units react accordingly in real-time. 

The term 'real-time' as used here means 
sufficiently fast (e.g., within roughly 15 to 0.3 

25 milliseconds) so that a human user 190 perceives the 
corresponding changes to the audiovisual show emanating 
from the audio and visual output units (e.g., VDU 180) 
to be occurring substantially instantaneously. 

The image 185 that is displayed on VDU 180 is 

30 defined in part by bitmapped data stored in one or more 
screen-band buffers {e.g., 140 and 141) within memory 
130-131. Each screen-band buffer contains one or more 
scanlines of bit -mapped image data. Screen-bands can 
be woven together in threaded list style to define a 
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full "screen". Alternatively, a single screen-band (a 
1 simple' panel) can be defined such that the one band 
holds the bit -napped image of an entire screen (e.g., 
a full set of 240 low- resolution lines or 480 high- 
5 resolution lines) . 

When the triangle- engine 150 and VPP 160 are 
being used simultaneously, it is preferable to use 
double- frame buffering (or multi- frame buffering) 
wherein the TE renders into a first frame-buffer and 

10 the VPP reads data from a pre-rendered second frame- 
buffer (from the 'active' FB) . The storage of the 
frame-buffer under- rendition and the active frame- 
buffer is preferably alternated between the MEMO and 
MKMl units, 130 and 131, so that while the VPP 160 is 

15 fetching the data of one frame-buffer (e.g. , FBI) for 
display purposes from one of the MEMO and MEMl units, 
the triangle-engine 150 can be simultaneously writing 
new data to the other of units 130 and 131. 

Multi-buffering is generally employed when 

20 animation is being displayed. The VPP 160 reads image 
data from a first, "active' frame-buffer for current 
display while the triangle- engine 150 writes data into 
a later- to-be displayed, alternate frame-buffer (s ) . 
Then one of the alternate frame-buffers is made the 

25 active buffer and the previous active area is 
relinquished to free memory space or used as the new 
frame -buffer under -rendering. The purpose of this 
alternation scheme is to allow both of the triangle - 
engine 150 and VPP 160 to simultaneously access system 

30 memory without constantly getting in each other's way. 

The VPP 160 should be able to fetch data in 
enough time so as to be able to generate a next line of 
display data when the next horizontal active- scan 
period (H-SCAN) starts. 
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At the same time, the triangle -engine 150 should 
be able to generate next- frame imagery at. sufficiently 
a fast pace so that when frame swapping occurs (when 
the 'under- rendition' FB becomes the currently 'active' 
5 FB) , the system user 190 will not see any image 
'tearing' but will rather perceive smooth 3D object 
rotation or other effects occurring in real-time. 

Referring to Fig. IB, a closer, more detailed 
view of the triangle -engine 150 is shown together with 
10 other system components that interact with the TE 150 
such as system memory 130-131 and CPU 110. 

The triangle- engine 150 is comprised of a memory 
interface module 151 (which includes a vertex-fetch 
unit VTX) , a pre- walk setup module (STP) 152, an edge- 
15 walking module (EWK) 153, a span-walking module (SWK) 
154, a texture mapping module (TXT) 155 (including a 
texture cache) , and a destination blending module (DBL) 
156. 

Modules 152, 153, 154 and 156 operate in 
20 sequential, pipelined fashion as will become apparent. 
The output of the texture mapping module 155 merges 
into the destination blending module 156 roughly in 
coincidence with the output of the span-walking module 
154. 

25 Each of modules 151-156 has a corresponding one 

or more control registers 161-166. The edge-walking 
and span-walking modules 153 and 154 share a common set 
of control registers 163. 

Control register 161-166 may be updated by so- 

30 called 'immediate' instructions or by 'deferred' 
instructions as will be detailed below. Some registers 
within set 161-166 are updatable only by a 'privileged' 
command issuer so as to prevent accidental alteration 
by wayward software modules. 
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The contents of control registers 163 are filled 
by the pre-walk setup module 152 when a 'short -format' 
mode is employed for the deferred instructions. A 
multiplexer is included in the triangle -engine pipeline 
5 for bypassing the pre-walk setup unit 152 when a 'long- 
format' mode is employed for the deferred instructions. 
This programmably- selectable bypass is indicated by 
dashed line 159. When bypass 159 is used, the data 
that would have been output by the pre-walk setup 

10 module 152 should be instead pre- calculated by some 
other means (e.g., CPU no) . This pre- calculated 
output data is loaded into the walkers' control 
registers 163 from a corresponding part of the •long- 
format' deferred instructions. 

15 System memory 130-131 contains a number of data 

structures which interact either directly or indirectly 
with the triangle- engine 150. 

A first, CPU- instructing portion 142 of system 
memory contains instructions for execution by the 

20 system CPU 110. 

A second, deferred- instructions portion 144 of 
system memory contains the so-called 'deferred' 
instructions for execution by the triangle -engine 150. 
A third, under- rendition FB portion 140 of system 
25 memory contains RGB color values for the individual 
pixels of a frame buffer that the triangle- engine 150 
is currently rendering to. 

A fourth, active-FB portion 141 of system memory 
contains RGB color values for the individual pixels of 
a frame buffer that the VPP 160 is currently reading 
from for purposes of display. 

A depth buffer area 146 is set aside in the 
system memory 130-131 for storing data representing the 
l/w FB value of each pixel in the corresponding under- 
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rendition FB 140. There is no separate depth buffer 
area for the currently active frame buffer. 

As shown in Fig. IB, the current 1/w buffer 146 
is preferably stored in a memory unit other than that 
5 which stores the corresponding, currently under- 
rendition FB 140. This is done to minimize contention 
between read/write accesses to the stored 1/w depth 
values for the under- rendition . frame -buffer and the 
read/write accesses to the stored corresponding RGB 

10 values of the frame-buffer under- rendition 140. 

System memory 130-131 further contains one or 
more texturing bit map areas 143, 145 that contain 
texture picture elements (texels) for optional blending 
with span-walker generated color data. A 4 texel' 

15 contains the coloration values {RGB} fc of a 
corresponding depth- independent texture space and it 
may also contain a corresponding blend factor {A} t for 
the texel. The general format of the data stored in a 
texel is thus {R,G,B,A} t . 

20 Due to the assignment of depth- independent 

texture space coordinates to the vertices of polygons 
to be rendered, for each polygon there is a 
correspondence between the area covered by a texel and 
the area of a display pixel. Depending on the 

25 orientation of the polygon in model space relative to 
the image plane, and other factors, this correspondence 
can vary significantly from one part of the polygon to 
another. Therefore, in order to improve rendering 
speed, texels can cover coarse areas of the depth- 

30 independent texture space (285 of Fig. 2D) or finer 
resolution areas. In one embodiment there are four 
levels of texturing resolution (L0D- level of detail) , 
a first that covers the equivalent in x,y image space 
of one frame-buffer pixel at a time; a second level 
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that covers the equivalent in x,y image space of four 
pixels (2x2) at a time; a third level that covers the 
equivalent in x,y image space of sixteen pixels (4x4) 
at a time; and a fourth third level that covers the 
5 equivalent in x,y image space of sixty- four pixels 
(8x8) at a time. Bach coarser texel of the second 
level is derived by averaging the texture information 
{R,G,B,A} t of four covered texels in the finer first 
level. Each texel of the third level is derived by 

10 averaging the texture information {R,G,B,A} fc of four 
covered texels in the finer second level. Each texel 
of the fourth level is derived by averaging the texture 
information {R,G,B,A} t of four covered texels in the 
finer third level. 

15 As the depth factor l/w Ip of a given interior 

point on a triangle surface (e.g., 230 of Pig. 2C) 
decreases from 1.00 to 0 (hither to yonder, see 
Fig. 2A) , the system automatically switches from the 
finer to the coarser levels of texturing. 

20 with regard to the deferred- instructions portion 

144 of system memory, this portion contains 
instruction blocks each having at least one 32 -bit wide 
block header and an optional subsequent set of deferred 
instructions generally formatted as 32 bits per 

25 instruction. The number of instructions in a block 
vary according to an instructions -count field within 
the block header. 

With regard to the texture bit maps 143, 145, 
these portions of system memory may be completely 
omitted if desired or encoded as one bit per texel 
(1BPT) , two bits per texel (2 BPT) , four BPT, six BPT, 
... up to 24 BPT. The 2-24 BPT code of a given texture 
bit map is routed into a texel -parameter look-up table 
(TLDT) which expands each unique coding into 
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corresponding texture information {R,G,B,A} t having 
more bits. One format for {R,G,B,A} t is 8/8/8/7/1 bits 
for respective fields R/G/B/A/D. (The 'D' bit is used 
by the VPP 160 for by-the-pixel dither bypass and/or 
S for other f unctions . ) 

A second, programmably- definable format for 
{R,G,B,A} t is 5/5/5/1/0 bits for respective fields 
R/G/B/A/D. in the second format, the TLUT data is 
packed 2-texels per 32 -bit word in system memory. 
10 A third, programmably- definable format for 

{R,G,B,A} t is 5/5/5/0/1 bits for respective fields 
R/G/B/A/D. The pre -TLUT texel encoding values are 
packed into 32 -bit wide memory words as appropriate for 
their value (e.g., eight 4 -bit wide texel encodes may 
15 be packed into a 32 -bit wide memory word) . 

With regard to the 1/w buffer 146, the respective 
1/wpB values consume 16 bits per frame-buffer pixel 
(16BPP) . This means that two l/w values may be stored 
in each 32 -bit wide memory word. The 16 BPP 
characteristic of the l/w values is a departure from 
the 24 BPP characteristic of conventional Z-buffers. 
Fairly good depth resolution is still provided because 
the 2 16 quantization steps possible for each 16 -bit 
wide l/wpg value are distributed ndnlinearly relative 
25 to the z axis. There are more such quantization steps 
for z close to zero (close to the observer plane) and 
fewer as one zooms out towards z»« (far away from the 
observer plane) . However, in an embodiment, depth 
resolution is further improved by representing the l/w 
values in the buffer 146 using a block- fix format 
described below. 

The frame buffer pixels may be each represented 
as 16 BPP or 24 BPP. When the 16 BPP format is used 
for the FB pixels, a 5/5/5-4/1-2 format is used wherein 
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five of the bits define a R field, another five of the 
bits define a G field, four or five further bits define 
a B field and the remaining two or one bits define one 
or more of a so-called, D-bit, a vertical -subposition 
bit and a horizontal subposition. The latter D and 
subposition bits may be used by the VPP 160 for 
functions operating on a by- the pixel basis. 

The coordinated operations of triangle- engine 
150, of the CPU 110, and of the memory regions 140-146 
are based on three-dimensional framework modeling such 
as shown in Fig. 2A. 

Referring to Pig. 2A, a perspective view is shown 
of a three-dimensional coordinate system 200 having an 
x-axis extending horizontally, a y-axis extending 
vertically and a z-axis extending generally into the 
plane of the figure. The three-dimensional space 
defined by these three coordinates is referred to 
herein as "model space", in the present embodiment, all 
displayable objects of the model to be rendered are 
20 placed behind the z=0 plane. The system user 190 is 
understood to be situated at the origin facing toward 
positive z half -space, and seeing a 2 -dimensional image 
projected onto a plane defined as z=d (d>0) . The z=d 
plane is referred to herein as the "pro j ection plane" 
or the ■ image plane", and its x and y axes are said to 
define a two-dimensional image space. In the present 
embodiment, the image plane is always in the positive 
z half of model space, and the x and y axes of image 
space are considered to be parallel with the x and y 
axes of model space. In another embodiment, another 
image plane can be assumed (whether or not 
perpendicular to the z-axis) , and/or a different set of 
axes can be chosen for image space (whether or not 
parallel to the x and y axes of model space). (»Co- 
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incident" axes are considered herein to be parallel to 
each other, coincidence being considered merely a 
special case of parallelism) . 

In the illustrated example, the three-dimensional 
object to be displayed is a spherical globe 210 having 
an outer surface 220. A pattern of sampling points are 
distributed about the three-dimensional object surface 
220. 

In the illustrated example, a regular pattern of 
vertex points such as 1, 2, and 3 are distributed at 
the intersections of regularly dispersed longitudes and 
latitudes . 

A surface -approximating planar triangle is 
hypothetically drawn for every set of three neighboring 
15 sample points. In the example of Fig. 2A, sample 
points 2, 2, 3 define the vertex points of hypothetical 
triangle 230. 

For the present embodiment, a linear transform 
between coordinate value z and a coordinate value w is 
defined as w=z/h, where h represents the point on the 
z axis through which the hither plane passes. This 
means that as z varies from a hither point of z=h to a 
yonder point near z=«, the reciprocal value l/w 
correspondingly varies from l/w=l (hither) to nearly 
25 l/w=o (yonder) . Note that if the image plane is chosen 
differently, then w can bear a different relationship 
to z. 

Assume that the range l/w= 0 to l/w= 1.00 is 
subdivided into the 2*6 equal parts. More of those 
parts will map to those portions of the z-axis that are 
closer to the hither region than those that map to 
portions of the z-axis closer to yonder. Finer depth 
resolution is therefore provided for those points that 
are closer to the viewer (in front of the z=0 plane) 
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than for those points that are further away from the 
viewer. This concept can be otherwise understood by 
noting that the derivative of l/w increases rapidly as 
z approaches infinity and decreases rapidly as z 
5 approaches zero. 

In more generic terms, for a linear transform, 
w= ojz+P, the reciprocal is i/w= l/(az+P) . The 
reciprocal l/w approaches zero as z approaches 
infinity. The reciprocal l/w approaches the value 1/p 
10 as z approaches zero. The derivative, dfw^J/dz equals 
-a/(az+p) 2 and approaches zero from the negative side 
as z approaches infinity, meaning that d(w _1 )/dz gets 
larger as z approaches infinity. 

In terms of physical digital systems, this means 
15 that one may store a 16 -bit wide value representing l/w 
in a 16 -bit memory location and obtain relatively good 
resolution for points near the viewer while suffering 
poorer depth resolution as one moves further away from 
the viewer along the z direction. 
20 This type of variation in depth resolution meshes 

well with human vision in terms of things near and far. 
Human beings tend to have better depth perception for 
objects close by than for those far away. A form of 
visual realism is therefor inherently created by using 
25 a digital word of finite bit width for storing the 
depth factor l/w. its resolving ability inherently 
mimics the ability of the human vision system to see 
depth perception at better resolution close up than far 
away. Block- fix representation improves resolution 
further at greater depths, as discussed below. 

What the human observer 190 sees depends on the 
optical properties of surface 220 and on surrounding 
lighting. The outer surface 220 of globe 210 may have 
any different kinds of optical properties. Surface 220 
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may be transparent or opaque or semi-transparent. It 
may be reflective or nonreflective. It may be formed 
of various combinations of transparent, opaque, semi- 
transparent, reflective portions distributed 
5 thereabout . 

A beach ball made from an almost -transparent 
shell material would be one example of a physical item 
represented by globe 210. (A completely transparent 
shell would be invisible.) A similar beach ball having 
10 the continents of the world imprinted on its outer 
surface as colored opaque regions but not the oceans 
would be another example, in the latter case one could 
see the backside of the globe by looking through the 
almost -transparent ocean areas. A classroom globe made 
with an opaque shell and having both the continents and 
the oceans of the world painted thereon using different 
opaque colors would be yet another example. 

Fig. 2B shows an example of a globe 210' whose 
outer surface 220' is painted with a multi-colored 
opaque pattern 250 representing the land masses of the 
Western hemisphere. Another portion 260 of surface 
220' is made semi-transparent and tinted with various 
shades of blue to indicate shallow and deep ocean 
areas. Uniform lighting is assumed. 

Assume that surface -approximating triangle 230 
(Pig. 2A) forms part of the semi-transparent ocean area 
260 (Fig. 2B) . 

Fig. 2C is a perspective view for showing how 
coloration and other parameters may be defined at 
30 interior points (IP) of a triangle such as 230. 
Triangle 230 is first defined as nothing more than a 
set of three vertex points Vx,, Vxj and VXj suspended in 
the three-dimensional model space of z>0. 
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A respective set of predefined parameters or 
attribute values are attached to each of the vertex 
points Vx,, V*2, Vxj. The attached parameter set of Vx, 
is shown as {x,, y v l/w v R 1# G ,, B,, A^ (u/w,) , (v/w,)}. 
5 Although not shown, it is understood that second 

vertex VXj has a correspondingly subscripted set of 
parameters defining 3 -dimensional location, coloration, 
blend and texture mapping; and that third vertex Vx 3 
also has a correspondingly subscripted set of 
10 parameters. Digital signals are stored in memory in 
the form of deferred instructions (e.g., in region 144 
of Pig . IB) for representing these predefined 
parameters . 

At this point in time there is no stored data in 
15 system memory for representing the x, y, l/w, R # G, B, 
A, u/w # v/w parameters of interior points of triangle 
230. The stored definition of triangle 230 by means of 
only the three vertex points, Vx,, Vxg and Vx^ is fairly 
compact and thus system memory space is conserved. 
20 The triangle- engine 150 fetches the so-called 

deferred- list containing the predefined parameters of 
vertex points Vx x , Vx 2 and Vx 3 by sending a 
corresponding request to MAC1 unit 121 (Pig. IB) . 
Memory and bus usage time is minimized by transmitting 
25 only this minimal information set to the triangle- 
engine . 

Upon obtaining and locally storing the 
information for vertex points Vx 1# VX2, VX3, the 
triangle -engine 150 next projects the vertex points 
30 onto the z=0 plane and determines which vertex point or 
points is highest and which point or points are lowest. 
One edge of the triangle having both a highest and a 
lowest vertex is designated as the 'long- walk' edge. 
Another edge of the triangle that shares the highest 
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vertex is designated as the 'short -walk' edge. This 
step of distinguishing vertices and edges is referred 
to as vertex sorting. 

The triangle- engine 150 then computes the slopes 
5 and lengths of the short/long walk edges 212 and 231. 

The respective parameters of each interior point 
(IP) having coordinates x jf yj and lying on the surface 
of triangle 230 are then determined by bi-linear 
interpolation . 

10 Bi- linear interpolation works as follows. Assume 

that each parameter (x, y, l/w, R, g, B, A, u/w, v/w) 
of each interior point IP on the surface of triangle 
230 is a linear function of its x and y coordinates. 
Using the color parameter R as an example, we can then 
15 write the linear equation {Eq. 1.0} as follows: 

Rj(x-j,yj) = ^x + b^y + {Eg. 1.0} 

We do not • yet . know the values of the equation 
constants, a, b and c. These can be solved for by 
noting that the same equation holds true at the three 
20 known vertex points Vx lt Vx 2 and Vx 3 having respective 
coordinates (x^y^ , (x^y^, and (x 3 ,y 3 ). 

Thus, the following three additional equations 
may be written: 

Ri(x 1/yi ) = a^ + b Ryi + c R { E q. 1.1} 

R2<*2,y2> = + b R y 2 + c R { E q. 1.2} 

R3(x 3 ,y 3 > - ^3 + b R y 3 -.- cjj {Eq. 1.3} 

The constants a R , bg, and c R can now be solved using 
well known. linear algebra techniques. This solution is 
carried out by the hardware of the pre -walk setup 
module 152 on a per triangle basis. 

Note that the values a R , b R and c R change with 
every new triangle. Accordingly, a new solution for 
a R , b R and c R is generally needed with the fetch of 
each new triangle. This is one of the jobs of the pre- 
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walk setup module 152 (Fig. IB) . The pre-walk setup 
module 152 can be computing the hi -linear interpolation 
constants a R , and for a second triangle while the 
edge-walker 153 and span-walker 154 are stepping across 

5 an earlier processed triangle. 

Not only are the linear equation constants a, b, 
and c different for each triangle, they are different 
for each parameter. Accordingly, the pre-walk setup 
module 152 computes separate equation constants for 

0 each of the parameters and then stores them in the 
walker's control registers 163 in accordance with the 
following TABLE IV: 



TABLE IV 



15 



R 


G 


B 


A 


u/w 


v/w 


l/w | 


a R 




a B 


a ft 


a U 


a Y 


a W 1 




be 


b B 


b A 




bv 


bw 1 



Note that linear equation constants, c R through 
c w , are not stored in the walker's control registers 

20 163. There is no need for them. The parameters of 
vertex points Vx x , Vx 2 . Vx 3 inherently incorporate the 
equation constants, c, in their respective parameter 
values. The edge and span walkers 153-154 start at the 
vertices and then increment from there by respective 

25 deltas. 

Note that the partial derivative of R with 
respect to x is equal to a^ The partial derivative of 
R with respect to y is equal to b R . The linear 
equation constant c R drops out during differentiation. 
30 Referring to Fig. 2D, it is sometimes desirable 

to blend in a texturing image with the native 
coloration of the framework triangles. For example, 
assume that a set of semi-transparent storm clouds are 
to be drawn lying over the ocean at region 270 in model 
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space. The rendering of such additional information is 
referred to as texturing. 

The textured image may be made to appear as if it 
wraps 3-dimensionally about the three-dimensional 
5 surface of the rendered native object. Pig. 2D shows 
an example wherein a regular five-pointed star 280 is 
projected onto the surface of globe 210" with rotation 
and perspective adjustment to thereby produce the 
3-dimensionally wrapped star 270 near the horizon of 
10 globe 210". 

The regularly- formed star 280 is defined in a 
two-dimensional space 285 having Cartesian coordinates 
u and v. Each i'th texture point (TP) in space 285 has 
associated therewith a respective set of coordinates, 

15 u,, v. and also respective coloration and blend 
information, (R,, B M G |f A,) t . No depth information is 
associated with texture points in depth- independent 
texture space 285, although it might in an embodiment 
that supports three-dimensional texture maps. 

20 The texture coloration and blend factors {R, G, 

B, A} t at a given texture point TP(u,v) is obtained by 
first converting the values u/w and v/w of a 
corresponding triangle surface point (IP) to respective 
u and v coordinates by dividing by l/w. As the value 

25 of l/w decreases towards zero, the steps taken in 
texture space 285 become larger and larger. The effect 
of these increasingly larger steps in texture space is 
to produce a more compressed projection onto the x-y 
plane during rendering. 

30 Such a compressive effect is seen in Fig. 2D 

where a head portion of star 280 generates a relatively 
large right arm of projected star 270 while the legs of 
original star 280 become compressed into the smaller 
respective left leg and left arm of projection 270. 
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The smaller left leg and arm portions of projection 270 
are closer to the globe horizon, and the head portion 
of projection 270 is closer to the equatorial center, 
the latter being the 3D point closest to viewer 190. 
5 The same star 280 in texture space 285 may be 

mapped multiple times onto surface 220". Thus, storm 
system 272 may be a second rendition of star 280 with 
its own unique set of rotations and depth values l/w. 
Any one change made in texture space 285 will be 

10 reflected to the multiple points on surface 220" that 
reference the changed point in texture space. Thus, if 
the head of star 280 is originally positioned at 
texture point TP(ui,vi), and the image of the star 280 
is moved upward in texture space to new position 280 ' , 

15 all native points on surface 220" that reference the 
original position TP(ui,Vi) will cease to show the head 
point of the star and will now show whatever new 
texture value {Ri,G i ,B i ,A i } T is moved into position 
TP(ui,vi). All native points on surface 220" that 

20 point to the new position TPtujcVfc) of the star tip 
will now pick up the newly- introduced texture value 
{ R k/ G k' B k' A k)T f or blending with their native colors. 

If desired, the two-dimensional cell engines 148 
may be used for making real-time modifications to the 

25 {R,G,B} T values of texture space 285. 

Referring to Pig. 3, the image plane is shown 
mapped onto a frame-buffer space composed of frame- 
buffer rows and columns. The long- walk (LW) and short - 
walk (SW) edges from vertex Vxx are also projected onto 

30 the image plane. 

Bach intersection of a row and column in the 
figure corresponds to a separate storage element of the 
frame buffer, and these storage elements sometimes are 
referred to herein as "image cells". Thus each image 



WO 96/36011 PCT/US96/06653 



-55- 

cell in the frame buffer corresponds to a respective 
"image region" in the image plane. The different image 
regions in the general case do not need to be defined 
in a grid as shown in Pig. 3, but it is convenient if 
5 they are. Also, because the attribute values 
eventually written into the image cells of a frame 
buffer represent collectively a displayable image, each 
such image cell is sometimes referred to herein as a 
"pixel". It should be noted, however, that due to 

10 subsequent image modification, and/or video post- 
processing activities such as optional image resolution 
enhancement or reduction, the individual image cells 
may not correspond exactly to individual pixels as 
ultimately seen on the display 185. ' Finally, note that 

15 as a result of the projection, each image region 
corresponds to a different region of the polygon 
surface in model space. The region of the polygon 
surface in model space which is projected onto a single 
image region in image space is referred to herein as a 

20 "surface region", m general, surface regions vary in 
size depending on perspective, and are not necessarily 
rectangular even if the corresponding image region is 
rectangular. 

In Fig. 3 it is assumed that the left edge of 
25 triangle 230 is the long- walk (LW) edge and the right 
edge is the short -walk (SW) edge. Edge walker 153 
begins at point Vx 1 in the x-y plane and begins to walk 
down the long-walk edge 212. The edge walker 153 stops 
at each point along edge 212 having a y coordinate 
30 corresponding to the dead center of a crossed- over 
frame-buffer pixel. The edge walker 153 then compares 
the x position of the long-walk edge point with the x 
position of the dead center point of the underlying 
frame -buffer pixel. If the dead center of the 
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10 



underlying frame-buffer pixel is to the right of the 
long-walk line 212, (and to the left of the short-walk 
line 231), then the edge walker 153 designates the 
underlying, frame-buff er pixel as being ' inside' the 
area of the walked triangle. 

If the edge -overlapped frame-buffer pixel is 
'inside' the triangle, the edge walker calculates the 
interior point parameters {x,y,R,G,B,A,u/w,v/w, l/v} ± of 
the dead center point of the first 'inside' frame-buffer 
pixel and hands the calculated parameter values over to 
the span-walker 154 by way of walkers control register 
set 163. 

The edge-walker 153 also calculates for the span- 
walker 154, the location (x and y only) of the last 

15 frame-buffer pixel that is 'inside' the triangle. The 
span-walker 154 then walks along the x-direction by 
increments equal to the center- to- center pitch of the 
frame-buffer pixels, and at each stop 'inside' the long 
and short -walk edges 212 and 231, the span- walking 

20 module 154 calculates the corresponding, iterated 
parameters, {x,y, R,G,B,A, u/w,v/w,i/w} f . Note that 
whereas the present embodiment performs these 
interpolations iteratively, other embodiments can 
perform each interpolation in closed form at each point 

25 in the triangle. 

As the span walker 154 steps center- to- center, 
the walker 154 also calculates the texture space 
mapping coordinates, u ± and v ± , from (the significands 
only of) the iteratively derived values of Hi, li and 
■i. The coordinates (h, Hi) are coordinates "into"* what 
is referred to herein as a "depth-adjusted texture 
space", because the coordinates of such space have been 
adjusted for depth. Iterative calculation of u, and v { 
is much easier using depth- adjusted coordinates rather 
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than the depth- independent coordinates (u,v) . 
Therefore, the texture space coordinates assigned to 
each triangle vertex as provided to the vertex unit 151 
is already adjusted for depth, and the iterative 
calculations are performed in the depth-adjusted 
coordinate system. The span walker module 154 converts 
the interpolated significands of depth- adjusted 
coordinates (J£, J£) for each point into depth- 
independent form (u, and v,) by dividing by the 
significand of (-L), before handing them off to the 
texture mapping module 155. The iterated coloration, 
blend and depth values {R, g, b. A, i/w}, are handed off 
to the blending unit 156 as shown in Pig. IB. 

While the span walker 154 is stepping across in 
15 the x direction for a first frame buffer row, the edge 
walker 153 proceeds to perform the ' inside' versus 
'outside' and other calculations for the next frame 
buffer row. The pre-walk setup module 152 is at the 
same time performing the set-up calculations for the 
next triangle and the vertex fetch unit 151 is at the 
same time fetching at least one new vertex Vxnew for 
next -processing by the pre-walk setup module 152. Thus 
the system is highly pipelined for efficient rendering 
of vector-based surface information onto a two- 
25 dimensional plane. 

In Fig. 3, those frame-buffer pixels that are 
designated as being 'inside' the triangle are marked by 
interior black squares at their dead center positions. 
Frame-buffer pixels that are deemed to be 'outside' the 
30 walked triangle are indicated as such by unfilled 
circles at their respective dead center positions. If 
the dead center point of a given frame -buffer pixel 
coincides exactly with the walked edge, 212 or 231, an 
arbitration scheme is carried out to designate that 
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frame-buffer pixel as being » inside' or 'outside' 
without allowing for double strikes or gaps between the 
triangles that share that frame -buffer pixel on exactly 
50%-50% basis. 

5 Referring to Fig. 4, the task allocation between 

the system CPU and the triangle engine is arranged to 
sake balanced, efficient use of both resources, no and 
150. 

As shown in portion 410 of the pipeline flow 

10 diagram 400 of Pig. 4 , . the CPU 110 manages the 
importation of data representing three-dimensional 
object frameworks from an image source such as a CD-ROM 
or the like into system memory. Alternatively, the CPU 
builds the data for the three-dimensional model space 

15 object frameworks on its own. 

Next, the system CPU HO accesses the memory 
location where the imported object definitions are 
stored and transforms them three-dimensionally as 
desired to scale the size of the object, to rotate it, 

20 to translate it, to distort it, etc., and to project it 
onto the two-dimensional image plane. Typically, these 
transformations are performed using 4x4 matrix 
• multiplications and homogeneous coordinates , naturally 
yielding a homogeneity value w or l/w for each polygon 

25 vertex. 

The CPU no is also responsible for clipping away 
from the framework any vertex points which fall outside 
of the display scene. 

The CPU no is further responsible for defining 
30 various lighting conditions. This can be done by 
modifying the native color values of each vertex point 
{R,G,B} Vx of each surface native point or changing its 
blend factor {a} Vx . This may also be done by 
appropriate mapping of texture values to the sample 
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points (vertices) of the 3D object surface 220 and 
varying the blend factor {a} t of the texturing 
coloration {R,G,B} T . 

The CPU no may also be given the task of 
locating surfaces that are totally covered by other 
opaquely- colored surfaces and of removing the vertex- 
based frameworks of those hidden surfaces to lessen the 
workload of the triangle engine. 

The CPU no then constructs vertex- based draw 
lists in the form of the deferred instruction lists 
which are stored in region 144 of system memory. 

Once a sufficiently- long list of deferred 
instructions has been developed for processing by the 
triangle engine 150, the CPU 110 sends an 'immediate' 
15 flow- control instruction to the triangle engine to set 
the triangle to work on the constructed list of 
deferred instructions area 144 . The pipelined 
processing of information then passes to the triangle 
engine as indicated by flow line 415. 

On occasion, the triangle engine 150 may run into 
an exception situation which requires intervention by 
the CPU 110. Such request for intervention is 
indicated by dashed flow line 455. The CPU responds 
accordingly with appropriate 'immediate' instructions. 

Referring to box 450 of Fig. 4, once the triangle 
engine is set to work on a given worklist (see Pig. 7) , 
the triangle engine 150 begins to periodically fetch 
from memory region 144 successive 'valid' portions of 
the worklist. 

30 The worklist generally starts the triangle engine 

with an initial task of building a so-called 'new' 
triangle having three newly-defined vertices. Strip 
and fan formations may be developed from the 'new' 
triangle by replacing one of the previous three 
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vertices with a new vertex and then confuting the 
parameters of the new triangle that includes the 
substituted vertex. 

With each set of three vertices, the pre-walk 
5 setup module 155 sorts the vertices as explained above. 
The symbol "(*)" in Fig. 4 indicates that this function 
may be bypassed through, bypass path 159 and performed 
instead by the CPU or another means. 

As box 450 further shows, the pre-walk setup 
10 module 152 of the triangle engine further computes 
various gradients for the walkers along the triangle 
edges and from pixel -center to pixel -center of each 
frame buffer row. 

The edge walker 153 walks along the triangle 

15 edges. The span walker 154 walks across frame buffer 
rows. As span walker 154 stops at the dead- center of 
each respective frame buffer pixel, the walker 154 
computes the iterated values u ± , v ± and sends 
corresponding signals to texture mapper 155. 

20 Texture mapper 155 fetches the requested texture 

data from its local cache. System software is 
responsible for seeing to it that the appropriate 
texture data is pre-loaded into the texture cache from 
a corresponding one of maps 143 and 145 in system 

25 memory. (Each triangle that is mapped onto the z=0 
plane (the x-y plane) has a counterpart triangle mapped 
onto texture space 285 (onto the u-v plane) . System 
software can predict from known vertex coordinates what 
portion of the texture space 285 will be covered while 

30 a given triangle is spanned in x-y space, and the 
system software can thus pre-load that data into the 
texture cache.) Note that in another embodiment, 
texture maps can be defined procedurally instead of 
with (or in combination with) look-up tables. 
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The texture mapper 155 includes the TLDT (texture 
lookup table) which converts the encoded texel data 
into full texture information {r,g,B,a} t . 

The blender module 156 blends the respective, 
5 various parameter signals as output {R,G,B,A, 1/w} ± by 
the span walker 154; as output {R,G,B,A} T by the 
texture mapper 155; and as originally found 
{R^B^A^fb in the frame buffer under -rendition 140 
and 1/w buffer 146; and blends these parameter signals 
10 together to produce a new frame buffer coloration 
signal, FB'=f ( Ci , Ai , Or, A T , C D ) which is a function 
of the input signals, where c A represents {R, G, B} ± , 
Or represents {R, G, B} T and C D represents the original 
frame-buffer coloration information {R, G, Bjps before 
15 the new value is written into the destination. The 
calculation can also depend on a blend factor A, 
associated with the original frame buffer data. 

After blending, the iterated depth factor 1/ Wi is 
tested against the original frame buffer value l/ WFB 
20 stored in buffer 146. if the test result proves true, 
the new coloration is written into the corresponding 
frame buffer location and the new i/ Wi value, if 
prevailing, is written into the corresponding location 
in depth buffer 146. If the test result is false, then 
25 no writing takes place. The depth test function is 
programmably selectable among the relationships: less 
than (<} , less than or equal to (<) , equal to (=} , 
greater than or equal to (>) , greater than (>) and not 
equal (ft), as between l/ Wi and l/ WFB . The depth test 
function can also be selected to always or never write 
the new coloration or 1/w, value. The desired test type 
is written by CPU no into the blend control registers 
166. In actuality, the CPU 110 selects among the above 
test types separately for the question of whether to 
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write coloration data and whether to write depth data- 
in another embodiment, the test type is predefined as 
one of the above relationships, is not changeable, and 
the same test applies to both questions. 
5 Referring to Fig, 5, time graph 500 illustrates 

the overlapped pipeline operations of the CPU 110 and 
the triangle engine 150 in respective time slots 1, 2, 
3, etc. Time -bounded operations of the CPU are 
respectively represented by boxes 511, 512, 513, 514, 
10 515 for respective time slots 1-5. Time-bounded 
operations of the triangle engine are similarly 
represented by boxes 553, 554, 555 for respective time 
slots 3-5. 

The creation of a deferred instruction list is 
15 represented by diagonal flow lines coming off of the 
CPU boxes such as lines 541, 542, 543, 544 and 545. 
Immediate instructions sent from the CPU to the TE are 
represented by vertical downlines such as 503 and 505. 
Before the triangle engine 150 begins operations 
20 on a given worklist, the CPU may spend any number of 
time slots to generate the deferred instructions of 
that worklist as indicated by work flowlines 541 and 
542. - 

At time slot 3 of the illustrated example, the 
25 CPU sends an immediate instruction 503 to the triangle 
engine to begin consuming a worklist having a pre- 
defined set of deferred instructions. The triangle 
engine begins to do so and at the same time the CPU may 
continue to generate more deferred instructions as 
30 indicated by work flowline 543. These newer 
instructions 543 may be picked up by the triangle 
engine 554 in time slot 4. Similarly in time slot 5, 
the CPU may send additional immediate instructions 505 
to the triangle engine in combination with previously- 
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triangle engine may thus be a combination of both 
immediate and deferred instructions. The input to the 
triangle- engine ISO is not limited by the output 
5 bandwidth of the CPU 110. 

Pig. 6 shows a data structure that may be used in 
system memory region 144 to define a block of 
'instructions' 610, 620, 630, 640, 650, etc. The 
listed instructions may be used for causing the 

10 triangle engine 150 to render one or more triangles, 
one or more » points' or one or more other structures 
into the under- rendition FB 140. 

The specific format of the instructions block is 
defined by a block header 600. The block header 600 

15 includes a first field indicating how many instructions 
are included within its corresponding block. The block 
header 600 further includes a second field indicating 
the format of each instruction within its corresponding 
block. The instructions count and format may change on 

20 a block by block basis. 

In the example of Pig. 6, it is assumed that the 
block header 600 has specified a collection of short - 
format vertex instructions for its corresponding block. 
The data structure for defining a first short -format 

25 vertex instruction is shown at 610. The size of 
instruction 610 will vary depending on which master 
functions, are turned on or off within the triangle 
engine and which local functions are turned on or off 
within the block header 600. 

30 Location- specifying words 611 for the image plane 

x and y coordinates are always present in a vertex 
instruction and are each 32 bits wide. Shading words 
612 for RGBA are present only if shading is turned on. 
The reciprocal Z -buffering word 613 is present either 
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if depth testing and. buffering is turned on, or if 
texturing is turned on. (The texturing process needs 
l/w in order to confute u and v from u/w and v/w. ) The 
texture mapping words 614 for u/w and v/w are present 
5 only if texturing is turned on. Control fields within 
the block header 600 indicate whether for the local 
block, shading is turned on, Z-buffering is turned on 
and/ or texturing is turned on. 

The remaining instructions 620, 630, etc., under 

10 block header 600 all have the same size and format as 
first instruction 610. 

Vertex instruction 620 contains the information 
for vertex Vx 2 . Vertex instruction 630 contains the 
information for vertex VX3. 

15 Vertex instruction 640 contains the information 

for a next vertex, in this example, Vx new . The new 
vertex Vx new defines a substitute for one of previous 
vertices defined as oldest and medium. Replacement of 
oldest or medium is selected depending on whether strip 

20 or fan formation is turned on within the block header 
600. When strip or fan formation is turned on, next 
triangles are specified with the addition of just one 
new vertex. Accordingly, a chain of many triangles 
each joined to the other by a common edge may be 

25 specified in a fairly compact manner that after the 
overhead of the block header and the first triangle in 
the chain, requires no more than one vertex instruction 
per triangle. 

The block header for vertex instructions is 32- 

30 bits wide and has the format as set forth in TABLE V: 
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TABLB V 



10 



15 



1 Field 


No. of 
Bits 


Description | 


ID 


4 


Instruction identification code => vertex 1 
instruction. | 


VBR 




Hardware version number needed by the 1 
hardware platform to support this 8 
instruction block. | 


CUT 


16 


Instructions count. Indicates how many 8 
vertex instructions follow this header (1- 
65,536). 


SHADE 


1 


0 s no shading information present in below 
vertex instructions, 1 = shading information 
is present. 


PRSP 


1 


■1" = 1/w information is present in below 
groups, "0" = no perspective information is 
present. 


TEXT 


1 


"1* s texture 4 nfnnwa ♦* -i rm A a nrooanf ^-n 

below vertex instructions; "0" s no texture 
information is present. 


HEW 


1 


"1" » the first three vertices define a new 
triangle for starting a strip or fan ^>v»*n 
or for defining the triangle by itself. "0 B 
= this is a continuation of a previous strip 
or fan chain. (This field is used only for 
short -format vertex instructions. When 
active, the new field causes all TE 
processing to halt until three new vertices 
have been in- loaded. 


RM 


1 


Replace mode. Vxnew replaces the following: 
■0" = replace oldest (strip generation) , "l" 
= replace medium (fan generation) . 




3 


Reserved. 



One encoding of the instruction ID field 
indicates that short -format vertex instructions will be 

20 used such as shown in Fig. 6. Another encoding 
indicates that a so-called long- format will be used. 
The long format instruction activates bypass 159 so 
that the operations of the pre-walk setup module 152 
are bypassed and the required information from that 

25 bypassed unit is instead provided by another source. 
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The information in a long- format vertex instruction i 
as follows: 





TABLE VI 


5 


Information 


Foimat 


Outputs (Vertex) 
Inputs (Edge) 




X- Coordinates 


12.0 


XI, X2 




Y- Coordinates 


12.0 


Y1,Y2,Y3 




Spatial Slopes 


S12.0 


xstep_long 
xstep_0 
xstep"~l 


10 


Characteristic 
Function 
Gradients 


sl2 0 


xystep_long 
xystep_0 
xystep 1 




Characteristic 
Function 
Slopes 


S12.0 


dy long 
dy_0 

dy""i 




Colors 


9.0 


R1,G1,B1 




Color Slopes 


89.11 


slope_r 

sloped 
slope b 
dDx_r 
dDxjg 
dDx b 




Blend Factor 


9.0 


Al 




Blend Factor Slopes 


S9.ll 


slope_a 


20 


Textures 
(u/w, v/w) 


11.13 


uWl,vWl 




Depth 
(1/w) 


1.23 


Wl 




Texture Slopes 


Sll.13 


slope_uW 
slope vw 
dDx_uW 
dDx~vW 




Depth Slopes 


61.23 


slope W 
dDx W 


25 


Scan Direction (Area) 


523.0 


R2L 



In a third encoding of the vertex instruction ID 
field, each vertex instruction may be used for coloring 
a single frame-buffer pixel rather than an area covered 
by a triangle, in such a case each vertex instruction 
has the following format: 
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Information 


Format 


Outputs (Vertex) 
Inputs (Bdcre) 


X- Coordinate 


12.0 


X 


Y - Coordinate 


12.0 


Y 


Colors 


9.0 


R1,G1,B1 


Depth Factor 


9.0 


Al 


Texture Depth 


1.23 


1/Wl 



10 



15 



20 



25 



30 



35 



Referring to Fig. 7, the triangle engine 150 
includes a next- read pointer register 710 and a next- 
write pointer register 720. 

The contents of the next -read pointer register 
710 point to the location 715 in system memory 
containing the instruction to be now executed by the 
triangle engine 150. After the instruction fetch unit 
151 has fetched this instruction, it normally 
increments the next -read pointer register 710 to the 
next sequential instruction in memory, unless otherwise 
commanded. 

One way that the instruction fetch unit 151 can 
be otherwise commanded is if the most recent 
instruction is a flow- control instruction telling the 
fetch unit 151 to instead add a relative jump value to 
the value currently in the next -read pointer register 
710. A relative jump around skipped instructions is 
indicated for example at 712. 

Another way that the normal fetch sequence of 
instructions is altered is when the CPU supplies an 
immediate command for changing instruction flow. 

The next-write pointer register 720 points to a 
location 721 in the deferred instructions area 144 of 
system memory where a next instruction may be inserted 
and thus appended onto the currently active worklist. 
(Region 144 can store more than one worklist.) 
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The next -read pointer 710 sweeps down from the 
top of the active TB worklist (from initial position 
711) towards the position 721 of the next-write pointer 
720. If the next -read pointer 710 advances so quickly 
5 that it bumps into the location 721 pointed to by the 
next-write pointer 720, the triangle engine 150 takes 
this to mean that it has exhausted its worklist. The 
triangle-engine then stops and sends an interrupt to 
the CPU 110. 

10 An already-done portion (DONE) of the worklist is 

defined by the sweep of the next -read pointer from 
initial position 711 to current position 715. The 
still to-be-done portion (TO DO) of the instruction 
list continues from beyond pointer 715 to just before 

15 the position pointed to by pointer 721. The under- 
construction portion of the list is defined as 
beginning at the location 721 pointed to by pointer 
register 720 and continuing down to the end of the list 
area. 

20 The next-read pointer register 710 and next-write 

pointer register 720 are part of the fetch control 
register set 161 shown in Pig. ib. A description will 
now be provided of these and other triangle engine 
control registers. The following convention is used to 

25 describe the accessibility of registers within the 
triangle engine 150: 
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TABLE VIII 



5 



Mode 


Description 


R 


The register/bit can be read as part of the address 
space of system memory. 


W 


The recxistsr/faifc ra« ho w^^^a** «- ^tM^i.. — — . 

2i ' oe written to directly as part 
of the address space of system memory. 


5 


The register/bit supports a bit -masked set 
operation. 


C 


The register /bit supports a bit -masked clear 
operation. 


§ 


The register/bit may only be accessed (read/write) 
by a privileged supervisor (e.g. by a command 
signal coming by way of the PPC bus interface 115 
f ran the CPU only and not by a deferred instruction 
fetched by way of the memory interface unit 151) 


HA 


Register/bits not available. 



Set and clear masks are provided so that 
individual bits may be set or reset without changing 
other fields within a given register. 

Read/write access to certain registers is 

15 privileged as indicated by the symbol. The contents 
of a § privileged register or field can only be read or 
modified by a privileged command signal coining from a 
privileged unit such as the CPU (e.g. and optionally 
further constrained to the condition of when the CPU is 

20 in privileged supervisor mode) and not by a non- 
privileged access request such as might come from a 
deferred instructions list by way of memory interface 
module 151. if a non-privileged access attempt is made 
to a privileged register, an access violation interrupt 

25 is automatically generated by the TE 150. 

The following first set of registers defines a 
so-called master control set which applies to the basic 
function blocks of the triangle engine: 
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TABLE IX 



S 



10 



Register Name 


Field 
Name 


Bit 

8 


Acces 
s 


Description 


TB_MaaterMode 




32 


RWSC 






wuaye 


a 


RWSC 


Disable shading 




DZBuff 


1 


RWSC 


Disable Z buffering 




DText 


1 


RWSC 


Disable texturing 




DBlend 


1 


§ 

RWSC 


Disable destination 
blender 




DDith 


1 


§ 

RWSC 


Disable dithering 




RSET 


1 


RWSC 


Reset the entire 
Triangle Engine 






26 


N/A 


reserved 



Note that each field of the TE MasterMode 
register is modifiable only by a privileged command 
15 giver (e.g., the CPU) , meaning that wayward software or 
a bad deferred instructions list is unlikely to modify 
these master registers. 

When shading is disabled, the pre-walk setup 
module 152 bypasses the step of solving for the linear 
20 equation constants for the shading values • 

When texturing is disabled, the texture mapper 
155 bypasses the step of fetching texture data from 
memory. 

When destination blending is disabled, the 
25 blender module 156 bypasses the step of fetching the 
old values from the frame buffer under rendition, and 
also bypasses most other destination blend functions. 
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TABLE X 





Register 
Name 


Field 
Name 


Bits 


Access 


Description 




TE_I_Cntl 




32 


§ RWSC 


a mmeoiat e I low- ControJL 
instruction register 


5 




STRT 


1 


§ RWSC 


otarc uriaiigxe engine 
execution at the 
deferred instruction 
address specified in 
the below 
TE_I_Cn t l_Data 
register 






STPI 


1 


§ RWSC 


Stop execution after 
current instruction 






STPL 


1 




catop execution at end 
of current instruction 
list 






RSTRT 


1 


§ RWSC 


Restart execution from 
stop point 






STEP 


X 


§ RWSC 


Single step - execute 
next instruction and 
then flush the TB 
pipeline 


10 




INT 


1 


§ RWSC 


Interrupt CPU after 
completion of current 
instruction | 








26 


N/A 


reserved | 



TABLE XI 



Register Name 


Bits 


Access 


Description 


TE_I_Cntl_Data 


32 


§RW 


Contains data referenced 
by instructions generated 
by the TE_I_Cntl flow 
control register 



TABLE XII 



25 



Register 
Name 


Field 
Name 


Bits 


Access 
§ RWSC 


Description 


TEJD_Cntl 




32 


Deferred instructions 
flow control register 




JR 


1 


§ RWSC 


Jump relative - use 
TBDCntlData as a 2's 
complement offset 
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5 





JA 


i 


5 RWSC 


Jump absolute - use the 
TEDCntrlData as 
destination address 




PSE 


l 


§ RWSC 






nrr 


i 


§ RWSC 


Internet the CPU and use 
contents of TKDCntlData 
as the vector data for 
the interrupt 




SYNC 


i 


§ RWSC 


Pause instruction 
execution until TE 
pipeline has been flushed 




TLD 


i 


5 RWSC 


Load nev texture data 
into texture cache 






26 


N/A 


reserved 



TABLE XIII 



Register Name 


Bits 


Access 


Description 


TKDCntlData 


32 


§RW 


Data referenced by TEDCntl 
register 



TABLE XIV 



25 



Register Name 


Bits 


Access 


Description 


IRP 


32 


§R 


Read pointer register which 
points to instruction 
currently being executed by j 
triangle engine | 




TABLE XV 


Register Name 


Bits 


Access 




IWP 


32 


§RW 


Instruction write pointer 
points to the first empty 
memory location beyond the end 
of list 
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XABLK XVI 



5 



10 



15 



Register 
Name 


Field Name 


Bits 


Access 


Description 


IntEn 




32 


§ RWSC 


Interrupt enable 
word 




ImlntEn 


1 


§ RWSC 


Immediate Interrupt 
Enable 




FBClipIntEn 


1 


S RWSC 


Pixels were 
discarded due to 
frame buffer 
clipping 




ALUS tat In tBn 


1 


§ RWSC 


Check for >, =, and 
< 




ZFuncStatlntE 
n 


1 


§ RWSC 


Result of Z 
buffering 




AnyRendlntEn 


1 


§ RWSC 


At least 1 pixel was 




SuperlntBn 


1 


§ RWSC 


Supervisor error 




Ui Ins tr IntEn 


1 


§ RWSC 


Un- implemented 
instruction 
interrupt enable 




Spins trlntKn 


1 


§ RWSC 


Special instruction 
interrupt enable 






14 




reserved | 



TABLE XVII 



20 



25 



Register 
Name 


Field Name 


Bits 


Access 


Description 


Intstat 






§ RWSC 


Triangle engine 
interrupt status 
word 




Imlnt 


1 


§ RWSC 






FBCliplnt 


1 


S R 


Pixels were 
discarded due to 
frame buffer 
clipping 




ALOStatlnt 


1 


§ R 


Check for >, =, and 

< 




ZFuncStatlnt 


1 


§ R 


Result of Z 
buffering 




AnyRendlnt 


1 


S R 


At least 1 pixel 

was rendered 1 
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5 





Superlnt 


1 


§ RWC 


Supervisor error 




Uilnstrlnt 


1 


§ RWC 


Un- implemented 
instruction 
interrupt status 




Splnstrlnt 


1 


§ RWC 


Special instruction 
interrupt status 




WinClipXnt 






Pixels were 
discarded due to 
window clipping 




LifltSndTTit" 


X 


3 RWC 


IRP == IWP (End of 
worJclist has been 
hit } 




Imlnstrlnt 


1 


§ RWC 


Immediate 
instruction 
interrupt status 




Dflnstxlnt 


1 


5 RWC 


Deferred 
instruction 
interrupt status 




DflnstrlntVec 
t 


8 


§ RWC 


Deferred 
instruction 
interrupt vector 






12 




reserved 



TABLE XVIII 



15 



Register Name 


Field 
Name 


Bits 


Access 


Description 


Vertex Control 




32 


§ RWSC 






VtxSnoopE 
n 


1 


§ RWSC 


Vertex snoop 
enable 






31 


N/A 


reserved 



The following table . describes the registers 
within the. setup control register set 162 of Pig. ib. 
Each of the setup registers contains a full floating 
point value having one sign bit, seven exponent bits 
and 24 mantissa bits. 
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TABLE XIX 





Rofli fit* AT* 


ACCQSB 


Description 




ex X 


l£ DU 


x coordinate of Vertex A 




a_y 


§ RW 


y - 


VxA 




5 


a_r 


§ RW 


R - 


VxA 






a-g 


§ RW 


G - 


VxA 






ajb 


S RW 


B - 


VxA 






a_a 


§ RW 


A - 


VxA 






a_W 


§ RW 


1/w 


- VxA 




10 


a_uW 


§ RW 


u/w 


- VxA 






a_vW 


§ RW 


v/w 


- VxA 






ivl-7 


5 RW 


seven registers for intermediate 
variables 1-7 




b_x 


§ RW 


x - 


VxB 








§ RW 


y - 


VxB 




15 


b_r 


§ RW 


R - 


VxB 






b_g 


§ RW 


G - 


VxB 






bjb 


§ RW 


B - 


VxB 






bji 


§ RW 


A - 


VxB 






b_w 


§ RW 


1/w 


- VxB 




20 


b_uW 


§ RW 


u/w 


- VxB 






b_vW 


§ RW 


v/w 


- VxB 






iv8-14 


§ RW 


intermediate variables 8-14 




c _ x 


§ RW 


x - 


VxC 






c — y 


S DU 

a xivf 


y - 


VxC 




25 


c_r 


§ RW 


R - 


VxC 






c_g 


§ RW 


G - 


VxC 






cjb 


§ RW 


B - 


VxC 






c_a 


§ RW 


A - 


VxC 






c_w 


§ RW 


1/w 


- VxC 




30 


C_uW 


§ RW 


u/w 


- VxC 






C_VW 


§ RW 


v/w 


- VxC 






ivlS-21 


§ RW 


intermediate variables 15-21 
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TABLE XX 



5 



Register Name 


Field 
Name 


Bit 
s 


Acces 

8 


Description 

- 


Vertex State 




32 


§ RW 


Triangle engine 
interrupt status 
word 




VCNT 


2 


§ RW 


Number of valid 
vertices 




TSORT 


3 


§ RW 


Order of incoming 
vertices, which is 
old, which is middle 
and which is new for 
vertices A, B, C 






27 







The walkers control register set 163 includes the 
following specified registers. 



TABLE XXI 



20 



Register 
Name 


Field 
Name 


Bits 


Access 


Description | 


ES_Cntl 




32 


§ RWSC 


Edge and span walkers 
control 




PRSPOff 


1 


§ RWSC 


Perspective 
calculations off 




DUScan 


1 


§ RWSC 


Down/Up Scan direction 
flag 




DSPOff 


1 


§ RWSC 


Disables double strike 
prevention 




reserved 


29 


N/A 





TABLE YYT a 



25 



Register 
Name 


Field 
Name 


Bits 


Access 


Description 


ESCapAddr 




32 


§ RW 


Address for edge -walker 
buffer capture 




ESCapAddr 


6 


§ RW 


Provides access to the 
following internal 
registers of the edge- 
walker 




reserved 


26 


N/A 
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TABLE XXII 





Edge Walker Buffer 


Access 


Description 1 






a R 






VI 


§ R 






fti 


§ R 






(il 


5 R 






Bl 


§ R 






Al 


§ R 






Wl 


§ R 






uWl 


§ R 






vWl 


§ R 






X2 


5 R 






Y2 


§ R 






Y3 


§ R 




20 


xstep 0 


§ R 






xstep l 


§ R 






xstep_long 


§ R 






xystep 0 


S R 






xystep 1 


5 R 




25 


xystep long 


§ R 






dy 0 


§ R 






dy i 


§ R 






dy long 


S R 






dDx r 


§ R 




30 


dDx g 


§ R 






dDx b 


§ R 






dDx a 


S R 






dDx W 


§ R 






dDx uW 


§ R 




35 


dDx vW 


S R 




1 slope r 


S R 
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slope g 


S R 




slope b 


§ R 




slope a 


§ R 




slope W 


§ R 




slope uW 


§ R 




slope vW 


§ R 




R2L 


§ R 





10 The blender unit 156 is subdivided into two 

successive modules: a texture blender, followed by a 
destination blender. Texture blending creates a 
combination of the iterated values {R,G,B,A}i and the 
corresponding texture components {r,g,b,a)t. In the 

15 destination blender portion of blender module 156, this 
blended value is then blended with the frame buffer 
value from a source frame buffer, or has other 
operations performed on it. The following 32 -bit wide 
registers in system memory address space are for use 

20 with the texture -blending portion of module 156. 



TABLE XXIII 



25 



30 



Register Name 


Access 


Description 


PIP 


§ RW 


PIP RAM 


TXTCntl 


§ RWSC 


Texture Mapper master control 
register 


TxtlidCntl 


§ RWSC 




TxtAddrCntl 


§ RWSC 


Address Generation Control 
Register/Filter modes 


TxtPIPCntl 


§ RWSC 


PIP control register 


TxtTABCntl 


§ RWSC 


Texture Application Control 
Register 


TxtlODBaseO 


§ RW 


LOD 0 Base Address 


TxtLdDstBase 


§ RW 


Texture load destination base 


TxtMMDestBase 


§ RW 


MMDMA Destination Base 


TxtlODBasel 


§ RW 


LOD 1 Base Address 


TxtLODBase 2 


§ RW 


WD 2 Base Arirt™ PS 
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TxtL0DBase3 


§ RW 


IiOD 3 Base Address 




TxtLdSrcAddr 


§ RW 


Texture decompressor source 
register 




TxtMMSrcBase 


§ RW 


MMDMA source base 




TxtByteCnt 


§ RW 


Byte Count for MMDMA 


5 




K DU 

a k.w 


Row count for uncompressed texture 
load 




TxtTexCat 


§ RW 


Texel Count for compressed texture 
load 




OVmax 


§ RW 


Texture Size Register 




UVmask 


§ RW 


Texture Mask Register 




TxtSRCTypeOl 


§ RWSC 


Source Description Registers - 
Type 0 and 1 


10 


TxtSRCType23 


§ RWSC 


Source Description Registers - 
Type 2 and 3 




TxtExpType 


§ RWSC 






TxtSRCCoastO 


§ RW 


Source Expansion Constant Register 
- Type 0 




TxtCLRConst 


§ RW 


MMDMA Clear Constant Register 




TxtCATConstO 


§ RW 


PIP Constant color - SSB = 0 


15 


TxtSRCConstl 


§ RW 


Source Expansion Constant Register 
- Type 1 




TxtPIPConstl 


§ RW 


PIP Constant color - SSB - i 




TxtSRCConst2 


§ RW 


Source Expansion Constant Register 
- Type 2 




TxtCATICanstO 


§ RW 


Texture Application Constant color 
- SSB = 0 




TxtSRCConet3 


§ RW 


Source Expansion Constant Register 
- Type 3 


20 


TxtCATICanstl 


§ RW 


Texture Application Constant color 
- SSB = l 




TxtSRCKxp 


§ RWSC 


Source Expansion Control Registers 
- Types 0, i, 2 and 3 



The following are registers used by the 
25 destination blender: 



WO 96/36011 



PCT/US96/06653 



-80- 



TABLE XXIV 





| Register Name 


AUCcBB 


Description 




Snoop 


§ RWSC 


Snoop control for Pest Blender 




IOCntl 


§ RWSC 


Destination Blend I Control 


5 


H DitherMatA 


§ RWSC 


First half of 4x4x4 dither matrix 


K DitherMatB 


§ RWSC 


Second half of 4x4x4 dither matrix 




FBClipSn 


5 RWSC 


Bnable clipping to frame buffer - 
this should be on by default 




FBClip 


§ RWSC 


X and Y frame buffer clip values 




WinClipBn 


§ RWSC 


Bnable X/Y Clipping (inside or 
outside clip region) 


10 


XWinClip 


§ RW 






YWinClip 


§ RW 


_Y window clip values 




WrCntl 


§ RWSC 


Write Control register 




reserved 


K/A 






Wr32Bpp 


§ RWSC 


: 

Wr pixel format (16 or 32 bpp) 8 


15 


WrOutBn 


§ RWSC 


Master control for component 

OUtpUt (turned Off for 2 rpnriori 

or test rendering) 




WrBaseAddr 


§ RW 






nzjLotrxcie 


§ RW 


x Stride value - Distance in words 
to next line 




SrCntl 


§ RWSC 


Read Control register 




SrcBaseAddr 


§ RW 


Read base pointer 


20 


SrcXStride 


§ RW 


Stride value - Distance in words 
to next line 




SrcOffset 


S RW 


X and Y Offset used for reads 




ZCntl 


§ RWSC 


Z Buffer Control register 




ZBaseAddr 


§ RW 


Z Base address - NO double 




ZXYOffset 


§ RW 


X and Y Offset used for 2, Offset 
calc is performed before clipping 


25 


ZClip 


§ RW 


Max clip values used for Z window 




SSBDSBCntl 


S RWSC 


SSB & DSB (D-bit) Control register 




RGBCntl 


§ RWSC 


per component enable for output 
(otherwise recycle old value) 




Constln 


§ RW 


Const used for computation input 
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10 



TxtMultCntl 


§ RWSC 


Texture color multiplication 
control 


DmTxtConstO 


§ RW 


Coef const s "0° 


DmTxtConstl 


§ RW 


Coef consts "l" 


SRCMultCntl 


§ RWSC 


Texture color multiplication 
control 


DmSrcConstO 


§ RW 


Coef consts "0" 


DmSrcConstl 


§ RW 


Coef consts "l" 


ALUCntl 


§ RWSC 


ALU Control register 


SrcAlphaCntl . 


§ RWSC 


Control of source alpha 


DestAlphaCntl 


§ RWSC 


Control of destination alpha 


DestAlphaConst 


§ RW 


Destination Alpha constants 


Status 


§ R 


Status indication word 


IntCntl 


S RWSC 


Mask indicating which ALU and Z 
results should generate interrupts 



IS The various privileged (§) and non- privileged 

registers may be read from or written to by 
correspondingly privileged and non -privileged control 
means for carrying out their respective functions. A 
Privileged control means can of course, always access 

20 a non-privileged TE register. 

Fig. 15 is a block diagram illustrating the major 
functional blocks of the set-up engine 152 (Fig. IB) . 
As previously described, the set-up engine takes the 
data at the three vertices of a triangle to be 

25 rendered, and calculates the various slopes and 
gradients used by the remainder of the pipeline to 
reconstruct the triangle. The center of the engine is 
a multi-port register file 1502 having four read ports 
and two write ports. Two of the read ports serve a 

30 floating point adder 1504, and the remaining two read 
ports are shared between a floating point multiplier 
1506 and a floating point divider 1508. One write port 
serves the adder and also input data from the vertex 
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10 



unit 151. The second write port serves the multiplier 
1506 and divider 1508. Output data to the edge walker 
control registers 163 is through the output from the 
adder 1504, because the last operation in all cases is 
a float ->fix calculation. 

The set-up engine 152 is controlled by a 
sequencer 1510, the microcode for which is stored in a 
ROM/PLA 1512. 

The register file 1502 contains three sets of 
vertex registers 1514, as well as a number of general 
registers 1516 for storing intermediate variables. 
Numbers received from the vertex unit 151 as "next 
vertex" data are in IEEE floating point format. 
However, they are converted into an internal floating 
15 point representation before being stored in the 
register file 1502. When a new vertex is received, the 
vertex engine 151 indicates a replace mode via a 
vtx_stpRM signal. This signal indicates which of the 
three preexisting vertices in the register file 1502 is 
to be replaced, and depends on whether the current 
sequence of triangles is a strip or a fan sequence. 

One of the operations performed by the floating 
point arithmetic units in the set-up module 152 in 
conjunction with the sequencer 1510 is conversion of 
the i' "S 311(1 $ ^ues from the internal floating point 
format to a block- fix format before they are output to 
the edge. walker control registers 163. The block- fix 
format at this stage contains 24 bits of significand 
and a 2 -bit "w- range" value. A w- range value of 0 
30 indicates that the significand ranges between [1/8, 1) . 
A w- range value of l indicates that the significand 
ranges between [1/64, 1/8). A w- range value of 2 
indicates that significand ranges between [1/512, 
1/64) . A w- range value of 3 indicates that significand 



20 



25 
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ranges between (o, 1/512). m essence, each w- range 
increment indicates a left -shift of three bits, so that 
a number encoded in the block- fix format can be 
calculated as s x 2' 3e , where s is the significand and 
e is the w- range value. 

The floating point arithmetic units illustrated 
in Pig. 15 are unimportant for an understanding of the 
present invention, except to note that they, in 
conjunction with the sequencer 1510, perform the block- 
fixing operation for the 1 data output to the edge 
walker. This is done by examining the 1 values stored 
in the vertex registers 1514 for all*" three current 
vertices, to determine the highest -numbered w- range 
into which they can be shifted. This w- range is given 
15 by the integer part of one-third of the smallest number 
of leading zeros in any of the three 1 values (but no 
more than nine bits of left shifting*) . Significant 
hardware savings downstream of the set-up engine 152 is 
achieved by shifting all three 1 values into the same 
20 w- range, rather than maintaining each in its own 
optimal w- range. Therefore, only one w- range value is 
passed to the downstream hardware from the set-up 
engine 152. Additionally, once the w- range is 
determined based on the 1 values for the existing three 
25 vertices, the apparatus also shifts the -H and Z values 
for each of the three vertices in the vertex registers 
1514, into the same w- range before passing these values 
downstream. The advantage of using the same w- range 
for the -H and Z values will become apparent below. 
Once the 1, 1 and I values have been converted to this 
block- fix format, it will be appreciated that 
downstream calculations, such as interpolation, will be 
able maintain up to nine bits of additional precision 
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while still using what is essentially fixed-point 
arithmetic. 

For example, as previously mentioned, the duties 
of the set-up engine 152 include calculating slope and 
d/dx values for the 1, | and 1 values. Because the 
-, -H and 1 values for all three vertices are already 
represented in the same w- range in the block- fix 
format, these calculations can be performed using only 
significance. The w-range value is. not referenced, and 
the resulting slope and d/dx values are inherently 
block- fixed in the same w-range. 

The floating point units in Fig. is perform the 
block- fix function by applying a relevant constant from 
a constants array 1518, via a multiplexer 1520, to the 
B input of the floating point adder 1504. The basic 
operation of a fix is to shift the mantissa of the 
floating point number from the vertex register 1514 by 
the number of places denoted by the difference in the 
exponents of the number to be fixed and the fix 
20 constant, which is always an Order of Magnitude 0 
(CMZ). m the normal course of events, this entails a 
right shift since the number to be fixed is within a 
range which is less than the fix constant. However, 
the number can exceed its allotted range, in which case 
25 a left shift is performed by the number of places 
denoted by the difference in the exponents. Initially, 
the exponents from the two inputs A and B of the 
floating point adder 1504 are compared. If the exponent 
of A is smaller than the exponent of B, then A is right 
30 shifted by the difference in the exponents. The least 
significant bits are discarded, if the exponent of B 
is smaller than the exponent of A, then the number has 
exceeded its allotted range, in which case A is shifted 
to the left by the difference in exponents and the most 
35 xnsignificant bits are discarded. The output is then 
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provided to the edge walker as a 32 -bit word having a 
sign bit, an exponent portion cleared to zero, and a 
mantissa portion. The mantissa portion has the fixed 
point value left justified therein, the remaining bits 
5 being zero. 

Note that the block- fix format for values A, « 
and 1 is used only when triangles are specified using 
the short form of vertex instructions. For triangles 
specified using long form vertex instructions, the set- 
10 up engine is bypassed and A, | and 1 values are 
transmitted to the edge walker always in w- range 0. 

Fig. 16 is a block diagram of the major 
functional units of the edge walker 153 (Fig. IB) . 
Data from the vertex engine 151 or the set-up engine 
15 152 is provided initially to a double buffer 1602, 
which then provides the data to five parallel data 
paths, all under the control of an EWK controller 1604. 
The five data paths are a characteristic function (C) 
data path 1606, a Y-data path 1608, an X-left data path 
1610, a color data path 1612, and a UVW data path 1614. 

The function of the double buffer 1602 is to 
create a pipeline between the set-up engine 152 and the 
edge walker 153 . The edge walker may be processing one 
triangle while the set-up engine is calculating the 
various parameters for the next triangle and placing 
them in the double buffer 1602. In addition to the 
numeric data and other parameters, the double buffer 
1602 stores the w- range value for the various block- 
fixed values represented for the triangle. 
30 rlie characteristic function data path 1606 

determines how the edge walker 153 moves from one pixel 
to the next along the long edge of the triangle. The 
algorithm is similar to a Bresenham line drawing 
algorithm. A similar calculation is used on the two 
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short edges of the triangle to determine the ending 
pixel in a span. 

Y data path 1608 merely adds 1 to the current y 
value every time the edge walker moves down one pixel 
to the next span. It checks for two conditions - 
whether the next y value will be equal to Y2, the 
switch point of the short sides (i.e., where side 0 
ends and side 1 begins) or equal to Y3, the bottom of 
the triangle. 

X-left data path 1610 calculates the left x value 
of each span, in other words, the points on the left- 
hand edges of the triangle. At the start of a new 
triangle, the X-left data path 1610 downloads from the 
double buffer 1602 the new x, and ^ values, as well as 
15 an x^ value. The value x, is latched, and incremented 
by x^ for each iteration. The x^ value is one of 
three values calculated by the set-up engine 152. if 
the current triangle is a left-to-right triangle (long 
side on the left), then the x^ value is the X 
20 If the long side is on the right, then the x 3tep value is 
^.p.o at the beginning of the triangle, and then at some 
point switches to x stepJ after the second vertex is 
reached. 

The color data path 1612 uses the color values, 
25 color slopes and color d/dx values to compute the 
starting colors (red, green, blue and blending factor 
A) along the long side of the triangle. The iterative 
color value will be the current value plus the slope, 
or the current value plus the slope plus or minus the 
30 d/dx value. The characteristic function determines 
which equation is used, and the direction of scan 
determines whether the d/dx value is to be added or 
subtracted. 
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Fig. 17 illustrates this more clearly. In Pig. 
17, the line 1702 represents a portion of the long edge 
of a triangle, passing through a grid of pixels having 
columns Or 5 and rows 0-5. On row 0, the leftmost pixel 
in the span is pixel 1704. This span endpoint is 
assigned an appropriate color value for the color then 
being processed, for example a red color value. The 
left endpoint of the span on row 1 is pixel 1706, and 
the color value is given by the color value of pixel 
1704 plus the red slope (which yields the color value 
at pixel 1708, outside the triangle), plus the d/dx 
value for red. The pixel 1710 is the leftmost pixel in 
the span on row 2, and the color value for red is 
calculated by adding only the red slope to the color 
15 value for pixel 1706, and so on. The d/dx value for 
red is not added for pixel 1706 because this pixel is 
already inside the triangle. 

Fig. 18 illustrates the major functional blocks 
of the color data path 1612. At the start of the 
triangle, all new color, slope and d/dx values are 
latched in from the double buffer 1602. The operation 
of this data path is a two- staged pipeline sequence of 
events. In the first stage, the slope and d/dx of one 
color, say red, are pre-added (or subtracted, as 
25 appropriate) and the result is latched in latch 1810. 
In the second cycle, a multiplexer 1812 controlled by 
the step_back signal from the characteristic function 
block 1606 determines whether the sum computed in the 
previous cycle will be added to the red value, or if 
just the slope value will be added to the red value. 
In the meanwhile, the sum of the slope and d/dx of the 
next color, green, are being computed in stage 1. 

In four cycles, this data path will rotate 
through all the colors, passing each one to the span 
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walker 154 cycle after cycle. This data path sets the 
edge rate of the edge walker 153 at no more than one 
every four cycles. 

Returning to Pig. 16, the UVW data path 1614 is 
5 responsible for the iterated A, 1 and 1 values along 
the long edge of the triangle and passing these to the 
span walker 154. The block diagram for this data path 
is the same as that shown in Pig. is for the color data 
path 1612, except that there are only three inputs 
10 instead of four. The outputs from this block are 
carried to the span walker 154 on ewk_swkUVW and 
ewk_swkUVWD/dx buses. since there are only three 
inputs to this data path block, only three cycles are 
required to compute the values necessary to transfer to 
15 the span walker. Since the color data path 1612 fixes 
the minimum edge rate at one per four cycles, the UVW 
data path 1614 remains idle during the fourth cycle 
while the last color value is being computed. 

Fig. 19 is a block diagram illustrating the major 
20 functional blocks of the span walker 154. The span 
walker takes the left and right endpoints for a given 
span, as well as the starting color values and depth- 
adjusted texture coordinates, from the edge walker 153, 
and linearly interpolates across the span to compute 
25 the corresponding values for each pixel. 

At the start of each span, the span walker 154 
receives seven starting parameters from the edge walker 
153: red, blue, green and blend factor, i, « and 1, 
all for the first pixel of the span, m addition, the 
d/dx values (the change or gradient in each of these 
seven parameters with one pixel step in the x 
direction) are also passed from the edge walker 153 to 
the span walker 154. The span walker 154 always begins 
its traversal of a span from the long side of the 
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triangle, so the traversal could be either from left to 
right or from right to left, as indicated to the span 
walker by a signal from the edge walker 153. 

The throughput of span walker 154 is two pixels 
5 per cycle. Each cycle, the span walker is responsible 
for passing color and texture values for a pair of 
pixels to the texture mapper 155, and also for passing 
x, y and 1 values for a pair of pixels to the 
destination blend unit 156. While color information is 

10 interpolated exactly (to the extent of the hardware 
data path widths) for each pixel, in order to achieve 
two pixel per cycle performance, the span walker 154 
exactly interpolates . the texture coordinates (u, v) for 
only every alternate pixel. The texture coordinates 

15 for intermediate pixels are calculated by averaging 
from the surrounding two pixels. The only exception is 
the endpoint pixel values, which are always computed 
exactly. The texture coordinates of these intermediate 
pixels are still considered herein to be 

20 "interpolated", even though their values might 
represent only an approximation of the exact 
interpolated value. 

Table XXV lists the format of each datum passed 
into or out of the span walker 154 . All data is 

25 handled as fixed-point numbers, and the format column 
indicates a sign bit (s, if present) , the number of 
integer bits, a decimal point, and the number of 
fractional bits. 
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5 



10 



15 



Information 


Format 


Source 


Destination 


Details 


Coordinates 


11.0 


Edge Walker 


Span Walker 


(Xleft,Xright,Y) 


Colors 


8,11 


Edge Walker 


Span Walker 


(red, green, blue) 


Blend Factor 


8.11 


Edge Walker 


Span Walker 


(A) 


Textures 


10.13 


Edge Walker 


Span Walker 


(u/w,v/w) 


Distance 


0.23 


Edge Walker 


Span Walker 


(1/w) 


Color d/dx 


s8.ll 


Edge Walker 


Span Walker 


d(r,g,b)/dx 


Di\*i\i rwsvor u/ua 


.O 11 

8o.il 


Edge Walker 


Span Walker 


d(A)/dx 


Texture d/dx 


310.13 


Edge Walker 


Span Walker 


d(u/w,v/w)/dx 


Depth d/dx 


s0.23 


Edge Walker 


Span Walker 


d(l/w)dx 


Coordinates 


11.0 


Span Walker 


Destination Blender 


(x0,y0) 


Distance 


0.16 


Span Walker 


Destination Blender 


(w0,wl) 


Colors 


8.0 


Span Walker 


Texture Mapper 


(red, green, blue) 


Blend Factor 


8.0 


Span Walker 


Texture Mapper 


(A) 


Textures 


10.4 


Span Walker 


Texture Mapper 


(u,v) 



There are a total of 17 parameters passed from 
20 the edge walker to the span walker. The first three 
are coordinate values, Xleft, Xright and Y. The X 
values define the endpoints of the span, and the Y 
value determines the Y location. All three of these 
values are represented as unsigned 11-bit integers, 
25 corresponding to the maximum 2K x 2K frame buffer size 
allowed by the triangle engine. The Y value is not 
used by the span walker, but is passed directly through 
to the destination blender, which requires both X and 
Y coordinates in computing the memory address to write 
30 the pixel data. The format for the X and Y coordinates 
passed from the span walker to the destination blender 
is also li.o. 
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The next seven items in the table are the 
starting parameters for the first pixel of the span. 
As discussed previously, the Span Walker will begin 
with either the leftmost or rightmost pixel of the 
5 span, determined by an ewk_swkLef t2Right signal from 
the edge walker. It will generate an R, G, B, A, u/w, 
v/w and l/w vaiue for each pixel in the span, and pass 
this data to the destination blender or texture mapper. 
The format of the color data (R, G, B, A) is 

10 8.11. The eleven bits of fraction are carried through 
all calculations in the span walker data path, but 
truncated when the data is passed to the Texture 
Mapper, resulting in an 8.0 color format from the span 
walker to the texture mapper. 

15 The l/w value, which is really only the 

significand in the block- fix format, is guaranteed to 
be within the range 0s(i/w)<i and is represented as a 
0.23 format number. This value is used internally to 
compute u and v from u/w and v/w, and also the iterated 

20 l/w significand is passed on to the destination blender 
for use in z-buffering. At this point the precision is 
reduced to 0.16. 

The u/w and v/w parameters also contain only 
significands, and have a format of 10.13 within the 

25 span walker data path. These values are used 
internally to compute u and v, which have a format of 
10.4 upon, transfer to the texture mapper. The ten bits 
of integer are necessary to access a lKxlK texture, 
which is the maximum size the triangle engine allows, 

30 and the four bits of fraction are used by the texture 
mapper for appropriate weighting of texels. As 
explained below, the u and v values provided to the 
texture mapper are inherently and automatically no 
longer in block-fix format. 
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The remaining parameters in the table are the 
d/dx values for the colors, u/w, v/w and 1/w. The 
format of each of these is the same as for the 
associated parameter, except that the d/dx value also 
5 includes a sign bit. This is because the value of each 
parameter could be increasing or decreasing along the 
span. The d/dx values for u/w, v/w and 1/w contain only 
signif icands. 

Fig. 19 is a block diagram illustrating the major 

10 functional blocks of the span walker 154. Since the 
span walker 154 processes two pixels per clock cycle, 
there is no sharing of arithmetic logic units as in the 
edge walker 153. Therefore a separate set of adders is 
used for each of the seven pixel parameters (R, G, B, 

15 A, u/w, y/w and 1/w) computed each cycle. 

Referring to Fig. 19, the span walker 154 
includes an x-y data path 1902, which receives the x- 
left, x- right and y values at the beginning of each 
span. Of these, the y value is piped straight through 

20 to the destination blender 156. If the scan direction 
is left -to- right, then the x-y data path 1902 
increments the x-left value by two each cycle until the 
end of the span has been reached. Similarly, if the 
scan direction is right- to- left, x-left is held 

25 constant and x- right is decremented by two each cycle 
until the end of the span has been reached. 

The. span walker 154 also includes four identical 
color data path modules 1904-R, 1904-G, 1904-B and 
1904-A (collectively, 1904). At the beginning of the 

30 span, the color value (format 8.11) and the color d/dx 
value (format s8.ll) from the edge walker are loaded 
into the registers in the appropriate color data path 
1904. On each cycle, the color data path 1904 computes 
color plus d (color) /dx and also color plus 2d (color) /dx 



WO 96/3*011 



PCTAJS96/06653 



-93- 

using separate adders. This allows the span walker 154 
to generate two pixel color values per cycle. The 
output of the color plus 2d ( color )/dx adder is latched 
back into the color value register to prepare for the 
5 next cycle. 

Before the color values are dispatched for 
blending, they are truncated to 8 bits (format 8.0). 

The span walker 154 also includes a l/w data path 
1906. Most of the l/w data path 1906 is the same as 

10 the color data path 1904. At the beginning of the 
span, the initial l/w (format 0.23) and the d/dx value 
(format s0.23) are loaded into the data path 1906 from 
the edge walker 153. As with the color data path 1904, 
the l/w data path 1906 generates two l/w values per 

15 cycle, namely: 

20 where the subscript "prev. » indicates the value for the 
previous x-position, the subscript "0 n indicates the 
value for the first x position newly calculated, and 
the subscript "l" indicates the value for the second x 
position newly calculated. Again, these calculations 

25 are performed as fixed-point calculations using only 
the significands of the block- fixed format data, and 
the w- range value does not need to be referenced at 
all. However, for transmitting to the destination 
blender 156, the calculated (l/w) significands are 

30 truncated to 14 bits (format 14.0) and concatenated 
with the 2 -bit w- range value. In addition, because the 
l/w value is also used in the confutation of the depth- 
independent texture coordinates (u, v) , there is also 
an output from the l/w data path 1906 to an inversion 
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module 1908. This output carries the (l/w) 0 significand 
in the full 0.23 format. 

The inversion module 1908 performs Newton - 
Rapheson inversion of the (l/w) 0 value to iteratively 
5 approximate l/(l/w), which is w (again, significands 
only) . once the w value is obtained, it is used in 
another part of the data path to multiply the iterated 
(u/w) and (v/w) significands, converting the texture 
coordinates to depth independent form, and inherently 
10 cancelling out the block- fix exponent, for the texture 
mapper 155. 

Part of the Newton- Rapheson inversion algorithm 
involves normalizing the input (l/w) significand so 
that it is between 1 and 2. This produces an inverse 

15 that is in the range between 0.5 and 1. The 
normalization is achieved with a left -shift of the 
input (l/w) 0 value until the first time a 1 is shifted 
out of the highest order fractional bit. The number 
resulting after inversion would need to be shifted in 

20 the same direction by the same number of bits in order 
to denormalize. However, the denormalization step is 
not performed at this time; instead, the number of bits 
of left -shifting required is passed to a uv data path 
1910 described below. 

25 The span walker module 154 also includes a u/w, 

v/w data path 1912, containing two identical blocks 
19 12 -u/w. and 19 12 -v/w. The data path 19 12 -u/w is 
described herein, and the data path 1912 -v/w is 
identical . 

30 The data path 1912-u/w computes the iterative u/w 

significands along the span beginning by latching a 
starting value (format 10.13 significand) and d/dx 
(format slO.13 significand) from the edge walker 153. 
The data path 1912-u/v uses an adder and a clamp block 
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to iteratively add 2d(u/w)/dx each cycle to the 
starting value. This is similar to the color and l/w 
data path blocks previously described. 

The data path 1912 -u/w does not compute a pair of 
values for each cycle as do the color and l/w data 
paths. They compute a value for the first new pixel 
along the span but not the second. This is because 
there is only one inversion module 1908 in the data 
path, so only one w value can be computed per cycle. 
Instead, the intervening u and v values are averaged 
from the values calculated from the previous and new 
pixels instead of being calculated directly. 

The internal representation of u/w significance 
is a number of format 10.13. For use in the final 
15 computation of u and v, though, only a 16 -bit value is 
required. Therefore, the least significant seven bits 
are truncated off u/w and v/w in creating the 10.6 
format outputs of the data path 1912. Before this 
occurs, however, the values are normalized by detecting 
20 leading zeroes and left -shifting until there is a '1' 
in the most significant bit. A maximum shift of seven 
bits is permitted, since a greater shift would begin 
introducing zero bits from the right. The number of 
bits shifted is passed on to the uv data path 1910. 
25 Note that this shift amount is separate from the shift 
amount represented by the w- range value with which u/w 
and v/w continue to be represented in the data path 



1912 



The uv data path 1910 is the final stage in the 
30 span walker 154 data path. Two identical blocks are 
used, one to compute (u/w) times w (data path 1910-u) , 
and the other to compute (v/w) times w (data path 19 io- 
v) . Only the data path 1910-u is discussed below. 
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Recall that only one w value and one u/w value is 
computed each cycle, although the span walker 154 must 
generate a pair of u values each cycle, since the span 
walker throughput is two pixels per cycle. As discussed 
5 previously, the span walker only computes every other 
u coordinate from the u/w and w values . The remaining 
u coordinates are computed as the average of the u 
values of the adjacent pixels along the span. The only 
exception to this rule is that the end pixel of a span 
10 is always computed exactly. This is to ensure that 
adjacent texture-mapped triangles will fit together 
seamlessly. 

Fig. 20 illustrates the major functional blocks 
of the uv data path 1910 (primarily data path 1910-u is 
15 shown) . The top of the data path consists of a 16x20- 
bit multiplier 2002, which receives a w value from the 
inversion module 1908 and a 16 -bit (u/w) value 
(significand only) from the data path 1912 -u/w. The 
resulting 36-bit product is a format 10.26 and is 
20 latched in latch 2004. Before the product can be used 
as a coordinate into the depth- independent texture map, 
however, it must be denormalized, because both the w 
and u/w values were subjected to normalization when 
they were computed. This denormalization is performed 
25 by block 2006. Signals are received here that indicate 
the degree of normalization to which w and u/w were 
subjected, and the normalization block 2006 determines 
from these values the number of bits to shift left or 
right to obtain a true u value in format 10.6. Note 
that in the denormalization process, the width of u is 
reduced from 36 bits to 16 bits. Importantly, note 
further that no shifting is required at this point to 
convert from block- fix representation of w or u/w to 
ordinary fixed-point representation. This is because 
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both u/w and 1/w as provided to the inversion module 
1908 were block- fixed into the same w- range. When 
(u/w) is divided by (1/w), which is the function 
performed by inversion module 1908 in conjunction with 
5 the multiplier 2002, the amount of the block- fixing 
shift cancels out. This is one reason why it is 
advantageous, in the set-up engine 152, to block- fix 
the u/w values into the same w- range as the 1/w values. 
Another reason is that u/w and v/w calculations also 

10 become more precise. 

The output of the normal izer 2006 in Fig. 20 is 
latched in a flip-flop 2008, then reduced to 10.4 
format and latched again in a flip-flop 2010. Because 
of the pipelined nature of this data path, when the 

15 output of the flip-flop 2008 holds the u value for 
pixel n of the span ("u H "), then the output of the 
normalizer 2006 will be the u value for pixel n plus 2 
of the span ("u^j"). Both of these quantities are 
provided to an adder 2020, the output of which is 

20 shifted right by 1 bit to become the average of the two 
numbers. The average is then reduced in width to a 
10.4 format and latched to become u, for the texture 
mapper 155. The output of flip-flop 2010 carries u„ for 
the texture mapper 155. 

25 One final capability of this part of the data 

path is to allow the exact computation of the u values 
for the last pixel in a span. If the span length is 
odd, this happens automatically as the last u value 
will naturally be computed exactly, and eventually 

30 appear on the u„ output. If the span length is even, 
however, the last u value will appear on the u, output, 
which is normally an averaged value. 

Note that the mux 2024 and adder 2020 at the 
bottom of the data path can be configured in such a way 
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aa to bring the output of the normalizer directly to 
the latch 2022. This is accomplished by sending the 
normalizer output to both inputs of the adder via mux 
2024. Usually, the output of the normalizer 2006 will 
5 be u H+2 when the output of the latch 2008 is u,, as 
discussed previously. if the normalizer output were 
instead u^,, then this exactly calculated u value could 
be routed to the latch 2022 achieving the desired 
result. This is in fact accomplished by adjusting the 

10 u/w, v/w and l/w data path blocks to increment by only 
1 d/dx instead of 2 on the last cycle of a span. 

Given the above, Figs. 21A and 21B (Fig. 21B is 
a continuation of Fig. 21A) illustrate an example 
process which can now be performed. Referring to Fig. 

15 21A, in a step 2102, an application program creates a 
model in model space and divides all polygons into 
triangles. Each vertex of each triangle has three 
spatial coordinates in model space, four color values 
<R,G,B,A) and two coordinates (u,v) into a depth- 

20 independent texture map. in a step 2104, the 
application program loads the texture buffer 143 and/or 
145 with the desired texture map. In a step 2106, the 
.. application program transforms the model as desired and 
projects it onto the image plane using 4x4 matrix 

25 computations and homogeneous coordinates. These 
computations naturally yield homogeneity divisors w for 
each vertex. 

In a step 2108, the application program divides 
all spatial vertex coordinates by w for the - vertex, 
30 yielding homogenized spatial coordinates (u/w, v/w) 
into an hypothetical depth -adjusted texture space. In 
a step 2110, the application program creates vertex 
instruction lists and passes them to the triangle 
engine 150. The values of each x/w, y/w, u/w, v/w and 
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1/w for each vertex are passed in full floating point 
representation. 

In a step 2112 , the vertex engine 151 and set-up 
engine 152 loop through the triangles to find by the 
5 vertices. The set-up engine sorts the vertices of the 
first triangle and computes walker gradients for the 
first triangle. The set-up engine compares the three 
l/w values to determine the optimum w- range that can be 
applied to all three l/w values, and block- fixes all 

10 three l/w values accordingly. The set-up engine 152 
also block- fixes all three u/w values and all three v/w 
values into the same w- range. The flow chart continues 
in Fig. 21B, as indicated by the circled symbol "B" 
appearing in both Figs. 21A and 2 IB. 

15 In a step 2114, the edge walker 153 walks along 

the edges of the current triangle in image space, and 
the span walker 154 walks across each span. In step 
2116, at the first pixel traversed by the edge walker/ 
span walker in image space, values for the following 

20 attributes are interpolated by bi- linear interpolation 
based on the corresponding attribute value at the 
triangle vertices: color (R,G,B,A) ; depth -adjusted 
texture space coordinates (u/w, v/w) ; and a homogeneity 
factor (l/w) . Only block- fix significands are used in 

25 the interpolation of depth- adjusted texture space 
coordinates and homogeneity factors, and only 
significands are interpolated for these values. 

In a step 2118, the span walker 154 next divides 
the depth- adjusted texture space coordinates (u/w, v/w) 

30 by the homogeneity value (l/w) calculated for the 
pixel, to compute depth- independent texture coordinates 
(u, v) for the pixel. Inherently, (u, v) are no longer 
in block- fix format. In a step 2120, the texture 
mapper 155 receives the (u, v) values and responds with 
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texture color values (R,G,B,A) T for the specified 
coordinates of depth -independent texture space. 

In a step 2122, the newly calculated l/w value 
for the current pixel is compared to the l/w value 
5 previously in the depth buffer for that pixel. If a 
newly calculated l/w value is nearer in model space to 
the image plane than the previous l/w value, then the 
process repeats at step 2114 with the edge walker/span 
walker proceeding with the next pixel. This comparison 

10 uses both the significands and the w-ranges 

If the comparison in step 2122 indicates that the 
newly calculated l/w value for the pixel represents a 
depth nearer than the l/w value previously in the depth 
buffer for that pixel, then in step 2124, the 

15 destination blender 156 blends the newly calculated 
color value for the pixel and the color value provided 
by the texture mapper 155 for the pixel, into the color 
value for the pixel as previously in the frame buffer, 
all in accordance with the blend factors A and possibly 

20 other control signals describing how to blend colors. 

If the blended color value is considered opaque 
(step 2126), then in step 2128, the homogeneity value 
i/w for this pixel is truncated and written to the 
corresponding depth cell with the 2 -bit w- range value 

25 for the current triangle. Each depth cell contains a 
total of 16 bits for the w- range number and the 
significand of the homogeneity value. The procedure 
then returns to step 2114 for the next pixel. If the 
blended color value is not considered opaque (step 

30 2126) , then the procedure returns to step 2114 without 
updating the homogeneity value in the corresponding 
depth cell. 

In step 2114, after all image space pixels of the 
current triangle have been rendered, the procedure 
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returns to step 2112 (as indicated by the circled 
symbol "A" appearing in both Figs. 21B and 21A) to set 
up the next triangle. If, in step 2112, all triangles 
have been rendered, the procedure is done (step 2139) . 
5 The above disclosure is to be taken as 

illustrative of the invention, not as limiting its 
scope or spirit. Numerous modifications and variations 
will become apparent to those skilled in the art after 
studying the above disclosure. For example, the 
ID invention is not restricted to RGB formats. Other 
digital formats such as YCC, or Composite Video 
Broadcast Standard (CVBS) , can also be used. For the 
sake of simplification, an RGB format was assumed 
above. 

15 Given the above disclosure of general concepts 

and specific embodiments, the scope of protection 
sought is to be defined by the claims appended hereto. 
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CLAIMS 
What is claimed is: 

1. A method for rendering a model from a three- 
dimensional model space into a frame buffer containing 
a plurality of image cells, each of said image cells 
containing, after rendering, values which represent 
visible attributes of a respective region of a 
displayable two-dimensional image, for use with a depth 
buffer containing a plurality of depth cells 
corresponding to said image cells, comprising the steps 
of: 

providing an homogenized projection of a first 
surface region of said model onto an image plane in 
said model space, said image plane having a plurality 
of image regions each corresponding to a respective one 
of said image cells, said homogenized projection 
including an identification of a first one of said 
image regions for said first surface region and a first 
homogeneity value for said first surface region; 

modifying the values in the image cell 
corresponding to said first image region in dependence 
upon an attribute value of said first surface region; 
and 

storing a first depth value which depends on said 
first homogeneity value, in a first one of said depth 
cells which corresponds to said first image region. 
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2. A method according to claim 1, wherein each 
depth cell in said depth buffer corresponds to a 
respective one of the image cells in said frame buffer. 

3. A method according to claim 1, wherein said 
model includes a planar model space polygon having a 
plurality of vertices each having three spatial 
coordinates in said model space, said polygon including 
said first surface region, wherein said step of 
providing comprises the steps of: 

determining an homogenized projection of each of 
said vertices onto said image plane, said step of 
determining identifying a respective one of said image 
regions in said image plane for each of said vertices 
and generating a respective homogeneity value for each 
of said vertices, the image regions identified for said 
vertices collectively defining an image space polygon 
of said model space polygon, said image space polygon 
covering at least said first image region; and 

determining said first homogeneity value in 
dependence upon the homogeneity values generated for 
each of said vertices and upon the position of said 
first image region in said image space polygon. 

4. a method according to claim 3, wherein said 
image plane is defined by z=z p in said model space, said 
image plane further defining a two-dimensional image 
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space having x and y axes parallel to x and y axes of 
said model space, respectively, 

wherein said image space polygon is a triangle 
having first, second and third vertices, said first, 
second and third vertices having homogenized 
coordinates in said image space of (^,21), (i^ f i£) and 
w'TT^' respectively, and wherein said first image 
region has homogenized coordinates of (^i,^l) g where — 
is said first homogeneity value, 

and wherein said step of determining said first 
homogeneity value comprises the step of bi- linearly 
interpolating ± from * i and i in accordance with 
relative distances of said first image region from 
respectively said first, second and third vertices. 

5. A method according to claim 3, wherein said 
attribute value of said first surface region includes 
a designation of a first texture region in a predefined 
texture map. 

S. A method according to claim 5, wherein said 
texture map is defined procedurally. 

7. A method according to claim 5, wherein each 
of said vertices further has associated therewith a 
respective plurality of spatial coordinates in a depth - 
adjusted texture space, further comprising the step of 
determining said first texture region in dependence 
upon the depth -adjusted texture space coordinates for 
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each of said vertices and the position of said first 
image region in said image space polygon. 

8. A method according to claim 7, for use with 
a texture map defined in a two-dimensional depth- 
independent texture space with axes u and v, 

wherein the depth-adjusted texture space 
coordinates of said first, second and third vertices 
include (^,^), (-|^> and respectively, 
where (JL), (JL) and (±) are the homogeneity values 
for said first, second and third vertices, 
respectively, and wherein said first texture region has 
depth- adjusted coordinates {?ti,Ii) t 

and wherein said step of determining said first 
texture region comprises the steps of: 

bi- linearly interpolating (Jk) from (J^), (J^) 
and(J2) in accordance with relative distances of said 

3 

first image region from respectively said first, second 
and third vertices; 

bi - linearly interpolating ( I± ) from ( Jl ) , ( Zi ) 
and( ^ ) in accordance with relative distances of said 
first image region from respectively said first, second 
and third vertices; 

bi-linearly interpolating (JL) from {— ), (-L) 
and(-i) in accordance with relative distances of said 
first image region from respectively said first, second 
and third vertices; and 
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applying (_L) to (h.Zl) to calculate (u.,v.) , 
said first texture region being identified by (u,.,v.) . 

9. A method according to claim 1, wherein said 
step of storing a first depth value which depends on 
said first homogeneity value comprises the steps of: 

representing said first depth value with numbers 
(e,s), where e indicates an e'th one of a predefined 
plurality of sub-ranges in a predefined range of all 
possible depth values, said e'th sub- range including 
discrete depth values, and wherein s indicates an s'th 
one of the discrete depth values within said e'th 
predefined sub- range; and 

storing e and s in said first depth cell. 

10 . A method according to claim 9 , wherein said 
step of representing said first depth value with 
numbers (e,s) comprises the step of selecting e and s 
such that s x 2" be is equal to said first depth value, 
b being a predefined constant greater than 1. 

11. A method according to claim 10, wherein b=3. 

12. A method according to claim 1, wherein said 
step of storing a first depth value which depends on 
said first homogeneity value comprises the step of 
storing said first homogeneity value, further 
comprising the steps of: 

providing an homogenized projection of a second 
surface region of said model onto said image plane. 
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said homogenised projection of said second surface 
region including an identification of said first one of 
said image regions for said second surface region and 
a second homogeneity value for said second surface 
region; 

subsequent to said step of storing a first depth 
value, comparing said first homogeneity value in said 
first depth cell with said second homogeneity value; 
and 

storing said second homogeneity value in said 
first depth cell if and only if said step of comparing 
determines that a first predefined one of the following 
relationships is true: c 1 < c 2 , c 1 s c 2 , c 1 = c^ 
c, a c 2 , c-j > c 2 and c 1 £ Cj, where c 1 is said first 
homogeneity value and c 2 is said second homogeneity 
value, 

13. A method according to claim 12, further 
comprising the step of modifying the values in the 
image cell corresponding to said first image region in 
dependence upon an attribute value of said second 
surface region if and only if a second predefined one 
of said relationships is true. 

14. A method according to claim 13, wherein said 
first predefined relationship is the same as said 
second predefined relationship. 
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15. A method according to claim 13, further 
comprising the step of displaying said displayable two- 
dimensional image on a display. 

16. A method according to claim 15, wherein said 
display includes a two-dimensional array of pixels, and 
wherein said step of displaying displays each of said 
image cells in said frame buffer in a respective 
different one of said pixels. 

17. A method according to claim 1, or use 
further with a texture buffer containing a plurality of 
texture cells, 

wherein the attribute value of said first surface 
region includes a pointer to a group of at least one of 
said texture cells in said texture buffer, 

and wherein said step of modifying the values in 
the image cell corresponding to said first image region 
comprises the steps of: 

retrieving a texture value from one of the 
texture cells pointed to by the attribute value of said 
first surface region; and 

changing the values in the image cell 
corresponding to said first image region in dependence 
upon said texture value. 

18. A method according to claim 17, wherein said 
step of changing the values in the image cell 
corresponding to said first image region in dependence 
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upon said texture value includes the step of changing 
the values in the image cell corresponding to said 
first image region in dependence upon a plurality of 
values including said texture value. 

19. A method according to claim 17, wherein said 
step of changing the values in the image cell 
corresponding to said first image region in dependence 
upon said texture value includes the step of changing 
the values in the image cell corresponding to said 
first image region in dependence upon a plurality of 
values including both said texture value and another 
attribute value of said first surface region. 

20. A method according to claim 17 , wherein said 
step of changing the values in the image cell 
corresponding to said first image region in dependence 
upon said texture value includes the step of changing 
the values in the image cell corresponding to said 
first image region in dependence upon a plurality of 
values including both said texture value and a prior 
value in the image cell corresponding to said first 
image region. 

21. Apparatus for rendering a model from a 
three-dimensional model space into a frame buffer 
containing a plurality of image cells, each of said 
image cells containing, after rendering, values which 
represent visible attributes of a respective region of 
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a displayable two-dimensional image, for use with a 
depth buffer containing a plurality of depth cells 
corresponding to said image cells, comprising: 

a projection provider of an homogenized 
projection of a first surf ace region of said model onto 
an image plane in said model space, said image plane 
having a plurality of image regions each corresponding 
to a respective one of said image cells, said 
homogenized projection including an identification of 
a first one of said image regions for said first 
surface region and a first homogeneity value for said 
first surface region; 

first modification means for modifying the values 
in the image cell corresponding to said first image 
region in dependence upon an attribute value of said 
first surface region; and 

first storing means for storing a first depth 
value which depends on said first homogeneity value, in 
a first one of said depth cells which corresponds to 
said first image region. 

22 . Apparatus according to claim 21 wherein each 
depth cell in said depth buffer corresponds to a 
respective one of the image cells in said frame buffer. 

23. Apparatus according to claim 21, wherein 
said model includes a planar model space polygon having 
a plurality of vertices each having three spatial 
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coordinates in said model space, said polygon including 
said first surface region, wherein said projection 
provider comprises : 

first determining means for determining an 
homogenized projection of each of said vertices onto 
said image plane, said first determining means 
identifying a respective one of said image regions in 
said image plane for each of said vertices and 
generating a respective homogeneity value for each of 
said vertices, the image regions identified for said 
vertices collectively defining an image space polygon 
of said model space polygon, said image space polygon 
covering at least said first image region; and 

second determining means for determining said 
first homogeneity value in dependence upon the 
homogeneity values generated for each of said vertices 
and upon the position of said first image region in 
said image space polygon. 

24 . Apparatus according to claim 23 , wherein 
said image plane is defined by z«z p in said model space, 
said image plane further defining a two-dimensional 
image space having x and y axes parallel to x and y 
axes of said model space, respectively, 

wherein said image space polygon is a triangle 
having first, second and third vertices, said first, 
second and third vertices having homogenized 
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coordinates in said image apace of (5.,L.), and 
* 3 'irJ' res P«ctively, and wherein said first image 
region has homogenized coordinates of where — 

is said first homogeneity value, 

and wherein said second determining means 
comprises means for bi- linearly interpolating JL from 
— , — and _L in accordance with relative distances of 
said first image region from respectively said first, 
second and third vertices. 

25. Apparatus according to claim 23, wherein 
said attribute value of said first surface region 
includes a designation of a first texture region in a 
predefined texture map. 

26. Apparatus according to claim 23, wherein 
said attribute value of said first surface region 
includes a designation of a first texture region in a 
predefined texture map, 

and wherein each of said vertices further has 
associated therewith a respective plurality of spatial 
coordinates in a depth-adjusted texture space, 

further comprising third determining means for 
determining said first texture region in dependence 
upon the depth- adjusted texture space coordinates for 
each of said vertices and the position of said first 
image region in said image space polygon. 
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27. Apparatus according to claim 26, for use 
with a texture map defined in a two-dimensional depth- 
independent texture space with axes u and v, 

wherein the depth- adjusted texture space 
coordinates of said first, second and third vertices 
include (h.Z) and (i # S )f respectively, 

where (JL} # (_L) and are the homogeneity values 

for said first, second and third vertices, 
respectively, and wherein said first texture region has 
depth- adjusted coordinates Zi) 

and wherein said third determining means 
comprises : 

means for bi-linearly interpolating ( h) from 
l Tr ] ' ( v } aadt^) In' accordance with relative 
distances of said first image region from respectively 
said first, second and third vertices; 

means for bi-linearly interpolating. (Zi) from 
( V } and ( "? ) in accordance with relative 
distances of said first image region from respectively 
said first, second and third vertices; 

means for bi-linearly interpolating (-L) from 
( "^ } ' ( "ST } 3X1(1 ( "^ } in accordance with relative 
distances of said first image region from respectively 
said first, second and third vertices; and 
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means for applying (±) to (h.Ii) to calculate 
(u^v^), said first texture region being identified by 
<u f ,v s ) . 

28. Apparatus according to claim 21, wherein 
said first storing means comprises: 

first selecting means for selecting numbers (e,s) 
for said first depth value,. e and s being selected such 
that s x 2" te is equal to said first depth value, b 
being a predefined constant greater than 1; and 

means for storing e and s in said first depth 

cell, 

29. Apparatus according to claim 28 , wherein 

b«3. 

30. Apparatus according to claim 21, wherein 
said first storing means stores said first homogeneity 
value, further comprising: 

second providing means for providing an 
homogenized projection of a second surface region of 
said model onto said image plane, said homogenized 
projection of said second surface region including an 
identification of said first one of said image regions 
for said second surface region and a second homogeneity 
value for said second surface region; 

comparing means for comparing said first 
homogeneity value in said first depth cell with said 
second homogeneity value; and 
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second storing means for storing said second 
homogeneity value in said first depth cell if and only 
if said step of comparing determines that a first 
predefined one of the following relationships is true: 
c 1 < c 2 , c 1 s c 2 , c 1 = c 2 , c t 2 c 2 , c 1 > c 2 and c, £ Cg, 
where c^ is said first homogeneity value and Cj is said 
second homogeneity value. 

31. Apparatus according to claim 30, further 
comprising second modifying means for modifying the 
values in the image cell corresponding to said first 
image region in dependence upon an attribute value of 
said second surface region if and only if a second 
predefined one of said relationships is true. 

32. Apparatus according to claim 31, wherein 
said first predefined relationship is the same as said 
second predefined relationship. 

33. Apparatus according to claim 21, for use 
further with a texture buffer containing a plurality of 
texture cells , 

wherein the attribute value of said first surface 
region includes a pointer to a group of at least one of 
said texture cells in said texture buffer, 

and wherein said first modifying means comprises: 
first retrieving means for retrieving a texture 
value from one of the texture cells pointed to by the 
attribute value of said first surface region; and 
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f irst changing means for changing the values in 
the image cell corresponding to said first image region 
in dependence upon said texture value, 

34. A method for rendering a model from a 3- 
dimensional model space into a frame buffer containing 
a plurality of image cells, each of said image cells 
containing, after rendering, values which represent 
visible attributes of a respective region of a 
displayable two-dimensional image, for use with a depth 
buffer containing a plurality of depth cells 
corresponding to said image cells, 

wherein said model includes a model space polygon 
having a plurality of model space vertices each having 
three spatial coordinates in said model space and each 
further having attribute values, comprising the steps 
of: 

providing an homogenized projection, of each of 
said model space vertices onto an image plane in said 
model space, said image plane defining a two- 
dimensional image space and having a plurality of image 
regions each corresponding to a respective one of said 
image cells, said homogenized projection including, for 
each of said model space vertices, two spatial 
coordinates in said image space and a homogeneity 
value, the spatial coordinates for each of said 
vertices identifying a respective image space vertex of 
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an image space polygon; 

calculating, by interpolation from said image 
space vertices, a homogeneity value for each of the 
image cells covered by said image space polygon; and 

for each given one of said image cells covered by 
said image space polygon, only if the homogeneity value 
calculated for the given image cell bears a first 
predefined quantitative relationship to the value then 
in the depth cell corresponding to said given image 
cell, modifying the value in the given image cell in 
dependence upon the attribute values of at least one of 
said model space vertices. 

35. A- method according to claim 34, wherein the 
homogeneity value calculated for a certain one of said 
image cells covered by said image space polygon does 
bear said first predefined quantitative relationship to 
the value then in the depth cell corresponding to said 
certain image cell, 

comprising the step of modifying the value in the 
certain image cell in dependence upon the attribute 
values of at least one of said model space vertices. 

36. A method according to claim 35, wherein each 
of said regions of said displayable two-dimensional 
image constitutes a pixel of a display. 

37. A method according to claim 35, wherein said 
predefined quantitative relationship is defined by one 
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of the relationships in the group of relationships 
consisting of < c . , c <= c . c - r 

3 new Vior' '-new < c prtor' c new " C prior 

C ne« * C pnor' C new >=s C prior 311(1 C ney ^ c P Mor' whert is the 

homogeneity value calculated for the given image cell 
and c pHor is the value then in the depth cell 
corresponding to said given image cell. 

38. A method according to claim 35, wherein said 
step of calculating a homogeneity value for each of the 
image cells comprises the step of bi- linearly 
interpolating the homogeneity value for each particular 
one of the image cells covered by said image space 
polygon from the homogeneity values of the model space 
vertices, in accordance with relative distances of the 
particular image cell from each of said image space 
vertices . 

39. A method according to claim 35, wherein the 
attribute values of each of said model space vertices 
include a respective color value, 

further comprising the step of interpolating a 
color value for said certain image cell from color 
values of the model space vertices, in accordance with 
relative distances of said certain image cell from each 
of said image space vertices, 

and wherein said step of modifying the value in 
the certain image cell in dependence upon the attribute 
values of at least one of said model space vertices 
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comprises a step of modifying the value in said certain 
image cell in dependence upon the interpolated color 
value for said certain image cell. 

40. A method according to claim 34, wherein the 
attribute values of each of said model space vertices 
includes a respective plurality of coordinates into a 
texture map space containing a predefined texture map, 

further comprising the step of interpolating a 
respective plurality of texture map coordinates for 
each particular one of the image cells covered by said 
image space polygon from texture map coordinates of the 
model space vertices, in accordance with relative 
distances of the particular image cell from each of 
said image space vertices, 

and wherein said step of modifying the value in 
a given image cell in dependence upon the attribute 
values of at least one of said model space vertices 
comprises a step of modifying the value in the given 
image cell in dependence upon attribute values of said 
texture map for the interpolated plurality of texture 
map coordinates for the given image cell. 

41. A method according to claim 35, wherein the 
attribute values of each of said model space vertices 
includes a respective plurality of spatial coordinates 
into a depth- adjusted texture space containing a 
texture map, said spatial coordinates for each of said 
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model space vertices being depth -adjusted in accordance 
with the homogeneity value for such vertex, further 
comprising the steps of: 

bi- linearly interpolating a respective plurality 
of depth-adjusted texture space coordinates for said 
polygon from the depth- adjusted texture space 
coordinates of the model space vertices, in accordance 
with relative distances of said certain image cell from 
each of said image space vertices; and 

applying the homogeneity value calculated for 
said certain image cell, to the depth-adjusted texture 
space coordinates interpolated for said certain image 
cell to determine a plurality of depth- independent 
texture space coordinates for said certain image cell, 

and wherein said step of modifying the value in 
the certain image cell in dependence upon the attribute 
values of at least one of said model space vertices 
comprises a step of modifying the value in said certain 
image cell in dependence upon attribute values of a 
depth- independent texture map at the depth- independent 
texture space coordinates for said certain image cell. 

42. A method according to claim 35, further 
comprising the step of writing the homogeneity value 
calculated for said certain image cell into the depth 
cell corresponding to said certain image cell. 
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43. A method according to claim 42, wherein said 
step of writing the homogeneity value calculated for 
said certain image cell into the depth cell 
corresponding to said certain image cell comprises the 
steps of: 

representing the homogeneity value calculated for 
said certain image cell with numbers (e,s), where e 
indicates an e'th one of a predefined plurality of sub- 
ranges in a predefined range of all possible depth 
values, said e'th sub-range including discrete depth 
values, and where s indicates an s'th one of the 
discrete depth values within said e'th predefined sub- 
range; and 

storing e and s in said certain image cell. 

44 . A method according to claim 43 , wherein said 
step of representing said first depth value with 
numbers <e,s) comprises the step of selecting e and s 
such that s x 2"** is equal to said first depth value, 
b being a predefined constant greater than l. 

45. A method according to claim 44, wherein b=3. 

46. A method according to claim 35, further 
comprising the steps of: 

representing the homogeneity value calculated for 
said certain image cell with numbers (e^s^), where 
e new indicates an e^'th one of a predefined plurality of 
sub-ranges in a predefined range of all possible depth 
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values, said e^'th sub-range including discrete depth 
values, and where s^ indicates an s^'th one of the 
discrete depth values within said e^'th predefined 
sub- range; 

representing said value then in the depth cell 
corresponding to said certain image cell with numbers 
^prior'Spnor) ' w &ere e prior indicates an e prior 'th one of 
said predefined plurality of sub-ranges in said 
predefined range of all possible depth values, said 
e pnor' th sub- range including discrete depth values, and 
where s pn - or indicates an s prior 'th one of the discrete 
depth values within said e prior 'th predefined sub- range; 
and . 

comparing (e r»' s neJ *ith <e pHor ,s priop ) to determine 
whether the homogeneity value calculated for said 
certain image cell bears said predefined relationship 
to the value then in the depth cell corresponding to 
said certain image cell. 

47. Apparatus for rendering a model from a 3- 
dimensional model space into a frame buffer containing 
a plurality of image cells, each of said image cells 
containing, after rendering, values which represent 
visible attributes of a respective region of a 
displayable two-dimensional image, for use with a depth 
buffer containing a plurality of depth cells 
corresponding to said image cells, 
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wherein said model includes a model space polygon 
having a plurality of model space vertices each having 
three spatial coordinates in said model space and each 
further having attribute values, comprising: 

a vertex projection provider of an homogenized 
projection of each of said model space vertices onto an 
image plane in said model' space, said image plane 
defining a two-dimensional image space and having a 
plurality of image regions each corresponding to a 
respective one of said image cells, said homogenized 
projection including, for each of said model space 
vertices, two spatial coordinates in said image space 
and a homogeneity value, the spatial coordinates for 
each of said vertices identifying a respective image 
space vertex of an image space polygon; 

first calculating means for calculating, by 
interpolation from said image space vertices, a 
homogeneity value for a given one of the image cells 
covered by said image space polygon; and 

first modifying means for, only if the 
homogeneity value calculated for said given image cell 
bears a first predefined quantitative relationship to 
the value then in the depth cell corresponding to said 
given image cell, modifying the value in said given 
image cell in dependence upon the attribute values of 
at least one of said model space vertices. 



WO 96/36011 



PCT/US96/06653 



-124- 

48. Apparatus according to claim 47, wherein 
said predefined quantitative relationship is defined by 
one of the relationships in the group of relationships 
consisting of c, < c . , c <= c - c = e 

^ new *-prior ' Sww * c prior' c new c p"or 

C new > C prior' C new >= C prior ^ C new * C prior/ where is the 

homogeneity value calculated for said given image cell 
and c P rior is the value then in the depth cell 
corresponding to said given image cell. 

49. Apparatus according to claim 47 , wherein 
said first calculating means comprises wMnq for bi- 
linearly interpolating the homogeneity value for each 
particular one of the image cells covered by said image 
space polygon from the homogeneity values of the model 
space vertices, in accordance with relative distances 
of the particular image cell from each of said image 
space vertices. 

50. Apparatus according to claim 47, wherein the 
attribute values of each of said model space vertices 
include a respective color value, 

further comprising means for interpolating a 
color value for said given image cell from color values 
of the model space vertices, in accordance with 
relative distances of said given image cell from each 
of said image space vertices, 

and wherein said first modifying means comprises 
means for modifying the value in said given image cell 
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in dependence upon the interpolated color value for 
said given image cell. 

51. Apparatus according to claim 47, wherein the 
attribute values of each of said model space vertices 
includes a respective plurality of coordinates into a 
texture map space containing a predefined texture map, 

further comprising means for interpolating a 
respective plurality of texture map coordinates for 
each particular one of the image cells covered by said 
image space polygon from texture map coordinates of the 
model space vertices, in accordance with relative 
distances of the particular image cell from each of 
said image space vertices, 

and wherein said first modifying means comprises 
means for modifying the value in said given image cell 
in dependence upon attribute values of said texture map 
for the interpolated plurality of texture map 
coordinates for said given image cell. 

52. Apparatus according to claim 47, wherein the 
attribute values of each of said model space vertices 
includes a respective plurality of spatial coordinates 
into a depth-adjusted texture space containing a 
texture map, said spatial coordinates for each of said 
model space vertices being depth- adjusted in accordance 
with the homogeneity value for such vertex, further 
comprising: 
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means for bi- linearly interpolating a respective 
plurality of depth- adjusted texture space coordinates 
for said polygon from the depth- adjusted texture space 
coordinates of the model space vertices, in accordance 
with relative distances of said given image cell from 
each of said image space vertices; and 

means for applying the homogeneity value 
calculated for said given image cell, to the depth- 
adjusted texture space coordinates interpolated for 
said given image cell to determine a plurality of 
depth- independent texture space coordinates for said 
given image cell, 

and wherein said first modifying means comprises 
means for modifying the value in said given image cell 
in dependence upon attribute values of a depth- 
independent texture map at the depth- independent 
texture space coordinates for said given image cell. 

53. Apparatus according to claim 47, further 
comprising first writing means for writing the 
homogeneity value calculated for said given image cell 
into the depth cell corresponding to said given image 
cell. 

54. Apparatus according to claim 53, wherein 
said first writing means comprises: 

first selecting means for selecting numbers (e,s) 
for the homogeneity value calculated for said given 
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image cell, where e and s are selected such that s x 2 

be 

xs equal to the homogeneity value calculated for said 
given image cell, b being a predefined constant greater 
than 1; and 

first storing means for storing e and s in said 
given image cell. 

55. Apparatus according to claim 47, further 
comprising: 

first representing means for representing the 
homogeneity value calculated for said given image cell 
with numbers (e^s^), where e^ indicates an e^'th 
one of a predefined plurality of sub- ranges in a 
predefined range of all possible depth values, said 
e new' th sub " range including discrete depth values, and 
where s^ indicates an s^'th one of the discrete depth 
values within said e^'th predefined sub- range; 

second representing means for representing said 
value then in the depth cell corresponding to said 
given image cell with numbers (e^, Spriar J , where e prior 
indicates an e pHor 'th one of said predefined plurality 
of sub- ranges in said predefined range of all possible 
depth values, said e prior 'th sub- range including discrete 
depth values, and where s . indicates an s . 'th one 

r* 1 Wl prior 

of the discrete depth values within said e . 'th 

prior 

predefined sub- range; and 
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first comparing means for comparing (e^s^) with 
* e prior' s prior^ to determine whether the homogeneity value 
calculated for said given image cell bears said 
predefined relationship to the value then in the depth 
cell corresponding to said given image cell. 

56. A method for rendering a model from a 3- 
dimensional model space into a frame buffer containing 
a plurality of image cells, each of said image cells 
containing, after rendering, values which represent 
visible attributes of a respective region of a 
displayable two-dimensional image, 

wherein said model includes a model space polygon 
having a plurality of model space vertices each having 
three spatial coordinates in said model space and each 
further having attribute values including a respective 
plurality of coordinates into a depth- independent 
texture space having a plurality of texture regions 
each having a texture value, comprising the steps of: 

providing a vertex projection of each of the 
model space vertices of said model space polygon onto 
an image plane in said model space, said image plane 
defining a two-dimensional image space and having a 
plurality of image regions each corresponding to a 
respective one of said image cells, each of said vertex 
projections including a respective relative depth value 
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and a respective plurality of depth -adjusted 
coordinates into a depth -adjusted texture space; 

representing each i'th one of the depth -adjusted 
coordinates of said vertices in said depth- adjusted 
texture space with a respective block range number e, 
and a respective significand s f/ each e, identifying an 
e^th one of a predefined plurality of sub- ranges in a 
predefined overall range of values, said sub -ranges 
including discrete values, and where each s f identifies 
a respective s/th one of the discrete values within the 
e/th predefined sub -range ; 

providing a first projection of a first surface 
region of said model space polygon onto said image 
plane, said first projection including an 
identification of a corresponding first one of said 
image regions in said image space and further including 
a first depth value relative to the depth values of 
said model space vertices; 

representing said first depth value with a block 
range number e, identifying an e/th one of said 
predefined plurality of sub- ranges, and a significand 
s, identifying an s^th one of the discrete values 
within the e/th predefined sub- range; 

calculating, by interpolation from the 
significands only of the depth -adjusted coordinates of 
vertices in said depth-adjusted texture space, 
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significands only of depth- adjusted coordinates of a 
first texture region corresponding to said first 
surface region of said model space polygon; 

applying the significand only of said relative 
depth value for said first image region to the 
significands of said depth-adjusted coordinates of said 
first texture region to determine depth- independent 
coordinates of said first texture region; and 

modifying the value in the image cell 
corresponding to said first image region in dependence 
upon the texture value of said first texture region as 
identified by said depth- independent identification of 
said first texture region. 

57. a method according to claim 56, wherein the 
block range number of the depth- adjusted coordinates of 
all of said vertices in said depth-adjusted texture 
space, and the block. range number of said first depth 
value, are all equal. 

58. A method according to claim 57, wherein said 
sub-ranges are defined such that each j'th one of said 
depth- adjusted coordinates of the vertices in said 
depth- adjusted texture space is given by * 2*** , and 
such that said first depth value is given by s 1 * 2~*\ 
where b is a predefined constant greater than l. 

59. A method according to claim 58, where b-3 . 
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60. A method according to claim 57, further 
comprising the steps of: 

representing the relative depth value of each 
j'th one of said vertex projections with a respective 
block range number ej - and a respective significand Sj , 
each ej identifying a respective e/th one of said 
predefined plurality of sub- ranges. and each Sj - 
identifying a respective Sj'th one of the discrete 
values within the e/th predefined sub- range; and 

calculating the significand of the relative depth 
value for said first image region by interpolation from 
the signif icands only of the depth values of said 
vertex projections, 

wherein the block range number of the relative 
depth values of said vertex projections are all equal. 

61. A method according to claim 56, for use with 
a depth buffer containing a plurality of depth cells 
corresponding to said image cells, further comprising 
the steps of: 

comparing said first depth value to a value 
previously stored in a first one of said depth cells 
which corresponds to said first image region; and 

storing e 1 in conjunction with s, in said first 
depth cell only if a predefined one of the following 
relationships are true: c, < c 2 , c, s c 2 , c, = c 2 , 
c i a °2» c i > c 2 and c, t Cj, where c, is said first depth 
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value and c 2 ia said value previously stored in the 
first depth cell. 

62. A method according to claim 56, for use with 
a depth buffer containing a plurality of depth cells 
corresponding to said image cells, further comprising 
the step of storing e, in conjunction with a, in a first 
one of said depth cells which corresponds to said first 
image region. 

63. Apparatus for rendering a model from a 3- 
dimensional model space into a frame buffer containing 
a plurality of image cells, each of said image cells 
containing, after rendering, values which represent 
visible attributes of a respective region of a 
displayable two-dimensional image, 

wherein said model includes a model space polygon 
having a plurality of model space vertices each having 
three spatial coordinates in said model space and each 
further having attribute values including a respective 
plurality of coordinates into a depth- independent 
texture space having a plurality of texture regions 
each having a texture value, comprising: 

a vertex projection provider of a vertex 
projection of each of the model space vertices of said 
model space polygon onto an image plane in said model 
space, said image plane defining a two-dimensional 
image space and having a plurality of image regions 
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each corresponding to a respective one of said image 
cells, each of said vertex projections including a 
respective relative depth value and a respective 
plurality of depth -adjusted coordinates into a depth- 
adjusted texture space; 

first representing means for representing each 
i'th one of the depth- adjusted coordinates of said 
vertices in said depth- adjusted texture space with a 
respective block range number e f and a respective 
significand s., each e- identifying an e.'th one of a 
predefined plurality of sub-ranges in a predefined 
overall range of values, said sub- ranges including 
discrete values, and where each s, identifies a 
respective s f 'th one of the discrete values within the 
e f 'th predefined sub -range ; 

a first surface region provider of a first 
projection of a first surface region of . said model 
space polygon onto said image plane, said first 
projection including an identification of a 
corresponding first one of said image regions in said 
image space and further including a first depth value 
relative to the depth values of said model space 
vertices ; 

second representing means for representing said 
first depth value with a block range number e, 
identifying an e/th one of said predefined plurality of 
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sub- ranges, and a significand s, identifying an s,'th 
one of Che discrete values within the e/th predefined 
sub -range; 

first calculating means for calculating, by 
interpolation from the significands only of the depth- 
adjusted coordinates of vertices in said depth- adjusted 
texture space, significands only of depth-adjusted 
coordinates of a first texture region corresponding to 
said first surface region of said model space polygon; 

first applying means for applying the significand 
only of said relative depth value for said first image 
region to the significands of said depth-adjusted 
coordinates of said first texture region to determine 
depth- independent coordinates of said first texture 
region; and 

first modifying means for modifying the value in 
the image cell corresponding to said first image region 
in dependence upon the texture value of said first 
texture region as identified by said depth- independent 
identification of said first texture region. 

64 . Apparatus according to claim 63 , wherein the 
block range number of the depth-adjusted coordinates of 
all of said vertices in said depth- adjusted texture 
space, and the block range number of said first depth 
value, are all equal. 
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65. Apparatus according to claim 64, wherein 
said sub- ranges are defined such that each j'th one of 
said depth- adjusted coordinates of the vertices in said 
depth- adjusted texture space is given by s j * 2'*** , and 
such that said, first depth value is given by s 2 * 2^, 
where b is a predefined constant greater than 1. 

66. Apparatus according to claim 64, further 
comprising: 

third representing means for representing the 
relative depth value of each j'th one of said vertex 
projections with a respective block range number ej and 
a respective significand s lt each e ; identifying a 
respective e-'th one of said predefined plurality of 
sub- ranges, and each s>- identifying a respective Sj'th 
one of the discrete values within the ej'th predefined 
sub -range; and 

second calculating means for calculating the 
significand of the relative depth value for said first 
image region by interpolation from the significands 
only of the depth values of said vertex projections, 

wherein the block range number of the relative 
depth values of said vertex projections are all equal. 

67. Apparatus according to claim 63, for use 
with a depth buffer containing a plurality of depth 
cells corresponding to said image cells, further 
comprising: 
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first comparing means for comparing said first 
depth value to a value previously stored in a first one 
of said depth cells which corresponds to said first 
image region; and 

first storing means for storing e, in conjunction 
with s, in said first depth cell only if a predefined 
one of the following relationships are true: c, < c^, 
C 1 5 02,0, = C2, c, a Cj, c, > 02 and c, c 2 , where c, is 
said first depth value and c 2 is said value previously 
stored in the first depth cell. 

68. Apparatus according to claim 63, for use 
with a depth buffer containing a plurality of depth 
cells corresponding to said image - cells, further 
comprising means for storing e, in conjunction with s, 
in a first one of said depth cells which corresponds to 
said first image region. 

69. A method for rendering a model from a 3- 
dimensional model space into a frame buffer containing 
a plurality of image cells, each of said image cells 
containing, after rendering, values which represent 
visible attributes of a respective region of a 
displayable two-dimensional image, for use with a depth 
buffer, containing a plurality of depth cells 
corresponding to said image cells, comprising the steps 



Of: 
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providing homogenized projections of a plurality 
of surface regions of said model onto an image plane in 
said model space, said image plane having a plurality 
of image regions each corresponding to a respective one 
of said image cells, said homogenized projections each 
including an identification of a respective different 
one of said image regions for respective ones of said 
surface regions and further including a respective 
homogeneity value; 

representing the homogeneity value for each i'th 
one of said surface regions with a respective block 
range number e. identifying an e.'th one of a predefined 
plurality of sub- ranges in a predefined overall range 
of values, each of said sub- ranges including discrete 
values, and a respective significand s i identifying an 
s.'th one of the discrete values within the e,'th 
predefined sub -range; and 

storing each e { in conjunction with its respective 
s- in the depth cell corresponding to the image cell 
identified for the i'th surface region. 

70. A method according to claim 69, further 
comprising the steps of, for an additional surface 
region of said model which projects onto a particular 
one of said plurality of image regions: 

providing an additional homogenized projection of 
said additional surface region of said model onto said 
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image plane, said additional homogenized projection 
including an identification of said particular image 
region and further including an additional homogeneity 
value ; 

representing said additional homogeneity value 
with an additional respective block range number e, 
identifying an e,'th one of said sub- ranges, and an 
additional significand s, identifying an s,'th one of 
the discrete values within the e,'th predefined sub- 
range; 

comparing said additional homogeneity value to 
the value which, in said step of storing, was 
previously stored in the depth cell corresponding to 
said particular image region; and 

storing e, in conjunction with s, in the depth 
cell corresponding to said particular image region only 
if a predefined one of the following relationships are 
true: c, <: c z , c, s ^ c, = c 2 , c, * c 2 , c, > ^ and 
c, J Cj, where c, is said additional depth value and c 2 
is said value which was previously stored in the depth 
cell corresponding to said particular image region. 

71. A method according to claim 70, further 
comprising the step of modifying the value in the image 
cell corresponding to said particular image region in 
dependence upon an attribute value of said additional 
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surface region, only if said predefined relationship is 
true. 

72. Apparatus for rendering a model from a 3- 
dimensional model space into a frame buffer containing 
a plurality of image cells, each of said image cells 
containing, after rendering, values which represent 
visible attributes of a respective region of a 
displayable two-dimensional image, for use with a depth 
buffer containing a plurality of depth cells 
corresponding to said image cells, comprising: 

a surface region projection provider of 
homogenized projections of a plurality of surface 
regions of said model onto an image plane in said model 
space, said image plane having a plurality of image 
regions each corresponding to a respective one of said 
image cells, said homogenized projections each 
including an identification of a respective different 
one of said image regions for respective ones of said 
surface regions and further including a respective 
homogeneity value; 

first representing means for representing the 
homogeneity value for each i'th one of said surface 
regions with a respective block range number e { 
identifying an e,'th one of a predefined plurality of 
sub -ranges in a predefined overall range of values, 
each of said sub- ranges including discrete values, and 
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a respective significance s, identifying an Sj 'th one of 
the discrete values within the e^th predefined sub- 
range; and 

first storing means for storing each e { in 
conjunction with its respective s,. in the depth cell 
corresponding to the image cell identified for the i'th 
surface region. 

73. Apparatus according to claim 72, wherein 
said model has an additional surface region which 
projects onto a particular one of said plurality of 
image regions, further comprising: 

an additional projection provider of an 
additional homogenized projection of said additional 
surface region of said model onto said image plane, 
said additional homogenized projection including an 
identification of said particular image region and 
further including an additional homogeneity value; 

second representing means for representing said 
additional homogeneity value with an additional 
respective block range number e, identifying an e/th 
one of said sub -ranges, and an additional significand 
s, identifying an s,'th one of the discrete values 
within the e/th predefined sub- range; 

first comparing means for comparing said 
additional homogeneity value to a value then in the 
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depth cell corresponding to said particular image 
region; and 

second storing means for storing e 1 in conjunction 
with s, in the depth cell corresponding to said 
particular image region only if a predefined one of the 
following relationships are true: c 1 < c 2 , c 1 s Cj, 
C 1 = °2> c i * c 2' c i > c 2 axxd c i £ ^2* where c 1 is said 
additional depth value and c 2 is said value then in the 
depth cell corresponding to said particular image 
region. 

74 . Apparatus according to claim 73 , further 
comprising means for modifying the value in the image 
cell corresponding to said particular image region in 
dependence upon an attribute value of said additional 
surface region, only if said predefined relationship is 
true . 

75. A graphical rendering system for rendering 
a model defined in a 3 -dimensional model space, into a 
frame buffer having a plurality of image cells, 
different surface regions of said model being rendered 
into different ones of said image cells, 

comprising a depth buffer having a plurality of 
depth cells corresponding to said plurality of image 
cells, the depth cell corresponding to each given one 
of said image cells into which a surface region has 
been rendered containing a value ( 1 ) which is 
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proportional to where z represents a depth in 

said model space of the surface region which was 
rendered into the given image cell, and where a and p 
are constants over all of said ' depth cells 
corresponding to image cells into which surface regions 
have been rendered. 

76. A system according to claim 75, wherein each 
i'th one of said values (1) contained in depth cells 
corresponding to image cells into which surface regions 
have been rendered, is represented in such depth cell 
with a respective block range number e f identifying an 
e.'th one of a predefined plurality of sub- ranges in a 
predefined overall range of values, each of said sub- 
ranges including discrete values, and a respective 
significand s. identifying an s^'th one of the discrete 
values within the e^th predefined sub- range. 

77. A system according to claim 76, wherein each 
i'th one of said values (1) contained in depth cells 
corresponding to image cells into which surface regions 
have been rendered, is given by | -ij =» s i * 2'**' , where 
b is a predefined constant greater than 1. 

78. A system according to claim 76, wherein each 
i'th one of said values (JL) contained in depth cells 
corresponding to image cells into which surface regions 
have been rendered, is given by — = s, * 2~ z&i . 
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79. A method for rendering a model from a 3- 
dimensional model space into a frame buffer containing 
a plurality of image cells, each of said image cells 
containing, after rendering, values which represent 
•visible attributes of a respective region of a 
displayable two-dimensional image, for use with a depth 
buffer containing a plurality of depth cells 
corresponding to said image cells, 

wherein said model includes a model space polygon 
having a plurality of model space vertices each having 
three spatial coordinates in said model space and each 
further having attribute values, comprising the steps 
of: 

providing an homogenized projection of each of 
said model space vertices onto an image plane in said 
model space, said image plane defining a two- 
dimensional image space and having a plurality of image 
regions each corresponding to a respective one of said 
image cells, said homogenized projection including, for 
each of said model space vertices, two spatial 
coordinates in said image space and a homogeneity 
value, the spatial coordinates for each of said 
vertices identifying a respective image space vertex of 
an image space polygon; 

calculating, by interpolation from said image 
space vertices, a homogeneity value for a particular 
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one of the image cells covered by said image space 
polygon; and 

if arid only if a first predetermined condition is 
satisfied, modifying the value in said particular image 
cell in dependence upon the attribute values of at 
least one of said model space vertices, 

said first predetermined condition being 
programmably selectable from among a predefined group 
of conditions consisting of: c, < c 2 , c, s c 2 , c, = c 2 , 
C 1 a °z' c i > c, i c 2 , always and never, where c, is 
the homogeneity value calculated for said particular 
image cell in said step of calculating, and is a 
value previously stored in the depth cell corresponding 
to said particular image cell. 

80. A method according to claim 79, further 
comprising the step of, if and only if . a second 
•predetermined condition is satisfied, storing in the 
depth cell corresponding to said particular image cell 
said homogeneity value calculated for said particular 
image cell in said step of calculating, said second 
predetermined condition being programmably selectable 
from among said predefined group of conditions. 

81. A method according to claim 80, wherein said 
first and second predetermined conditions are 
programmable separately. 

82. a configurable imaging system comprising: 
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(a) system memory means for storing first data 
representing pixels of an under- rendition frame-buffer 
second data representing vertex-based render 
instructions, and third data representing reciprocal 
depth information l/ WpB associated with each pixel of 
said under- rendition frame-buffer; and 

(b) triangle -engine means for extracting from the 
system memory, said vertex-based render instructions 
and for producing therefrom render data for writing 
into the under- rendition frame -buffer. 

83. A method for controlling a real-time 
rendering edge comprising the steps of: 

(a) defining in a first region of system memory, 
a block header having an instructions count field, 
where the block header is to be processed before 
corresponding instructions, where the block header 
further defines a format for the corresponding 
instructions ; « tv1 

(b) defining in a second region of said memory 
flow- control instructions. 
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Remove frameworks of hidden surfaces 
Construct vertex-based draw lists 
Set TE to work on desired lists 
Intervene on TE workflow when necessary 
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TRIANGLE ENGINE TASKS 
Fetch worklist 

Build triangle (3 vertices, strip, fan) 

Sort vertices (*) 

Compute walker gradients (*) 

Walk along edges in image space 

Walk across span in image space 

Map to texture plane k fetch texture 

Blend texture into interpolated span color 

Blend current FB color into mix 

Test depth <k cond write of new FB color 

Do next triangle 
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VERTEX-INSTR #21 
VERTEX-INSTR #22 
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VERTEX-INSTR #2M 



INVALID-INSTR 



END OF LIST 



DONE 



TO 
DO 



UNDER 
CONST 



SlIBSTmnE SHEET (RULE 28 



WO 96/36011 



8/22 



PCT/US96/06653 




o 



SUBSTITUTE SHEET (RULE 261 



WO 96/36011 



PCT/US96/06653 



9/22 





SUBSTITUTE SHEET (RULE 26} 



WO 9606011 PCT/US96/06653 

10/22 



1 



CD 



in 

tn 
in 

CO 

II 



o 



§1 



00 
CO 

CM 



o 
II 



SUBSTITUTE SHEET (RULE 26} 



WO 96/36011 



11/22 



PCT/US96/06653 




>»1 



£5 



SUBSTITUTE SHEET (RULE 261 



WO 96/36011 



12/22 



PCT/US96/06653 




SUBSTITUTE SHEET (RULE 2(fi 



WO 96/36011 



13/22 



PCT/US96/06653 




SUBSTITUTE SHEET (RULE 26^ 



WO 96736011 



14/22 



PCT/US96/06653 




SUBSTITUTE SHEET (RULE 261 



WO 96/36011 



PCT/US96/06653 



15/22 



CONTROL 
TO/FROM 
VTX 




DATA 
TO EWK 



FIG. 15 (STP) 

SUBSTITUTE SHEET (RULE 26) 



WO 96/36011 



16/22 



PCT/DS96/066S3 




gflDCT tn t 



WO 96/36011 



17/22 



PCTAJS96/06653 




FIG. 17 



4 



SUBSTITUTE SHEET (RULE 26} 



WO 96/36011 



18/22 



PCT/US96/06653 



alphaNew 

b lueNew 
greenNew 
iredNew 



s lope_aNew 
slope.bNew 
slope.gNew 
slope_rNew 



dadxNew 
dbdxNew 
dgdxNew 
drdxNew 



-1B12 




ewk.swkColorDdx 



Clamp ing 



ewk.swkCo lorVal 



FIG. IB 



SUBSmUTE SHEET (RULE » 



WO 96/36011 



19/22 



PCT/US96/06653 




SUBSTITUTE SteffiULF 26V 



WO 96/36011 



20/22 



PCT/DS96/066S3 



(U/W), (V/W), 









Wi 

~1 — . 


! 




i 

1 V : 



multipl 
16x20 



10.6 



2024 
-+10.6 



7 



7 



■2002 



10.26< 



^normalizer/ 



s-2004 

r200B 



v 2020 



10.6 < 



y^200B 



L 



2022 



10.4 < 



r 2010 



10 .4 < 



u v Datapath 
(U) 




(V)| 



10.4- A 10.4^ JfoA 
U1 V, U 0 V 0 



1910-U 
1910-V 
1910 



FIG. 20 



SUBSTITUTE SHEET (RULE 26) 



WO 96/36011 



21/22 



PCT/DS96/06653 



I EXAMPLE PROCESS") 



L 



2102 



APPLICATION PROGRAM CREATES MODEL IN MODEL SPACE AND 
DIVIDES ALL POLYGONS INTO TRIANGLES. EACH VERTEX HAS 3 
SPATIAL COORDINATES IN MODEL SPACE, 4 COLOR VALUES (R.G.B.A), 
AND 2 COORDINATES (u,v) INTO A DEPTH-INDEPENDENT TEXTURE MAP. 



DONE 
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APPLICATION PROGRAM LOADS TEXTURE BUFFER WITH DESIRED 

TEXTURE MAP 
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APPLICATION PROGRAM TRANSFORMS MODEL AS DESIRED AND 
PROJECTS MODEL ONTO IMAGE PLANE USING 4X4 MATRICES AND 
HOMOGENOUS COORDINATES. NATURALLY YEILDS HOMOGENEITY 
DIVISOR w FOR EACH VERTEX. 



APPLICATION PROGRAM DIVIDES ALL SPATIAL VERTEX COORDINATES 

BY w FOR THE VERTEX, YIELDING HOMOGENIZED SPATIAL 
COORDINATES (u/w, v/w) INTO AN HYPOTHETICAL DEPTH-ADJUSTED 

TEXTURE SPACE. 
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APPLICATION PROGRAM PASSES VERTEX INSTRUCTION LISTS TO 
TRIANGLE ENGINE , USING FLOATING POINT REPRESENTATIONS OF x/w, 
y/w, u/w, v/w AND 1/w FOR EACH VERTEX . 
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FIRST A 



SETUP ENGINE SORTS VERTICES AND COMPUTES WALKER GRADIENTS 

FOR FIRST TRIANGLE. SETUP ENGINE COMPARES THE THREE 1/w 
VALUES TO DETERMINE OPTIMAL w-RANGE THAT CAN BE APPLIED TO 
ALL THREE 1/w VALUES, AND BLOCK FIXES ALL THREE 1/w VALUES 
ACCORDINGLY.. SETUP ENGINE ALSO BLOCK FIXES u/w AND v/w 
VALUES INTO THE SAME w-RANGE. 
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NEXT 
PIXEL 
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EDGE WALKER WALKS ALONG EDGES OF THE CURRENT TRIANGLE IN 
IMAGE SPACE, AND SPAN WALKER WALKS ACROSS EACH SPAN. 



NOT DONE 



DONE 
£2116 



AT FIRST (NEXT) PIXEL TRAVERSED IN IMAGE SPACE, VALUES FOR THE 

FOLLOWING ATTRIBUTES ARE INTERPOLATED BY BI-LINEAR 
INTERPOLATION BASED ON THE CORRESPONDING ATTRIBUTE VALUES 
AT THE TRIANGLE VERTICES: COLOR (R.G.B.A); DEPTH-ADJUSTED 
TEXTURE SPACE COORDINATES (u/w, v/w) AND AN HOMOGENEITY 
FACTOR (1/w). ONLY BLOCK FIX SIGNIFICANDS ARE USED IN THE 
INTERPOLATION OF DEPTH-ADJUSTED TEXTURE SPACE COORDINATES 
AND HOMOGENEITY FACTORS, AND ONLY SIGNIFICANDS ARE 
INTERPOLATED FOR THESE VALUES. 
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SPAN WALKER THEN DIVIDES THE DEPTH-ADJUSTED TEXTURE SPACE 
COORDINATES (u/w, v/w) BY THE HOMOGENEITY VALUE (1/w) 
CALCULATED FOR THE PIXEL, TO COMPUTE DEPTH-INDEPENDENT 
TEXTURE SPACE COORDINATES (u,v) FOR THE PIXEL. 
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TEXTURE MAPPER RECEIVES (u.v) AND RESPONDS WITH TEXTURE 
COLOR VALUES {R.G,B,A} T FOR THE SPECIFIED COORDINATES OF DEPTH- 
INDEPENDENT TEXTURE SPACE. 



(FARTHER) 
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DOES THE NEWLY CALCULATED (1/w) FOR THE PIXEL REPRESENT A 
DEPTH NEARER OR FARTHER IN MODEL SPACE FROM THE IMAGE PLANE, 
THAN THE (1/w) VALUE PREVIOUSLY IN THE DEPTH BUFFER FOR THAT 

PIXEL? 



NEARER 
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BLEND NEWLY CALCULATED COLOR VALUE FOR THE PIXEL, THE COLOR 
VALUE PROVIDED BY THE TEXTURE MAPPER FOR THE PIXEL, AND COLOR 
VALUES FOR THE PIXEL AS PREVIOUSLY IN THE FRAME BUFFER. 



NO 
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IS BLENDED COLOR 
VALUE OPAQUE? 
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YES 
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TRUNCATE SIGNIFICAND OF HOMOGENEITY CONSTANT (1/w) FOR THIS 
PIXEL TO 14-BITS AND WRITE INTO DEPTH CELL WITH 2-BIT w-RANGE 

NUMBER. 
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